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(54) Data storage medium, and apparatus and method for reproducing the data from the same 



(57) A data storage medium having stored thereon 
includes an audio sequence including at least one audio 
stream; at least one video stream; and- reproduction 
control information for controlling reproduction of the at 
least one audio stream and the at least one video 
stream. The reproduction control information includes 
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reproduction order information defining the order of at 
least one video stream, among the at least one video 
stream stored on me data storage mediunvwhich is- to 
be reproduced in synchronization with the audio 
sequence. 
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Description 

BACKGROUND OF THE INVENTION 

1 . FIELD OF THE INVENTION: 5 

[0001 ] The present invention relates to a data storage 
medium having still picture data and audio data stored 
thereon, and an apparatus and method for reproducing 
the still picture data along with high quality audio data. 10 

2. DESCRIPTION OF THE RELATED ART: 

[0002] Conventionally known optical disks which 
music information including audio data or moving pic- 75 
ture information is recorded to and reproduced from 
include compact disks (CDs) and laser disks (LDs). 
Compact disk recording and reproduction apparatuses 
and laser disk recording and reproduction apparatuses 
are conventionally known apparatuses for recording 20 
and reproducing the above-mentioned information. 
[0003] A CD is an optical disk having a diameter of 12 
cm for storing music information digitally recorded using 
an encoding technology referred to as linear PCM. CDs 
have been utilized as a medium for storing data for 25 
music applications. An LD is an optical disk having a 
diameter of 30 cm for storing moving picture information 
with audio data recorded as an analog signal. LDs have 
been utilized as a medium for storing data for video 
applications such as movies. 30 
[0004] Recently, optical disks having a diameter of 
about 12 cm for storing music information and moving 
picture information including audio data digitally 
recorded. On such optical disks, either one or both of 
the music information and moving picture information 35 
are efficiently recorded in the state of being digitally 
compressed in order to acquire longer recording times, 
and higher quality. Some of such optical disks have a 
file structure which facilitates data exchange with com- 
puters and communication apparatuses. 40 
[0005] In order to meet demands for mediums having 
higher quality recording and reproduction of audio infor- 
mation than CDs, the DVD-Video technology has been 
developed and put into practice for realizing high quality 
audio data reproduction of linear PCM, 96 kHz and 24- 45 
bit sampling. However, the DVD-Video standards can- 
not provide high quality linear PCM multi-surround 
audio data reproduction or higher quality audio data 
reproduction. Accordingly, a higher quality technology 
has been in demand. However, the amount of high qual- so 
ity audio data is quite large and thus the reproduction 
speed of data required is quite high. Thus, it is difficult to 
record and reproduce image data along with the high 
quality audio data. 

[0006] In the case of reproducing an audio title, there ss 
is a demand that the image data be used as supplemen- 
tal data to the audio data as, for example, a so-called 
jacket picture such as lyrics and figure of the player. In 
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order to realize this, recording and reproduction appara- 
tuses having more flexible and a wider variety of func- 
tions are required. The functions include reproduction of 
video data synchronously with high quality audio data, 
reproduction of video data asynchronously from audio 
data, manipulation of video data by key operation, and 
special effects at the time of video data switching. 
[00071 There is another demand that data be repro- 
duced as the title creator intended. For example, the ini- 
tial operation when the disk is inserted should not be 
determined by the system used, and direct selection of 
the audio data should be permitted or prohibited in 
accordance with the intention of the title creator regard- 
less of the type of the player. 

[0008] Conventional optical disks, and recording and 
reproduction apparatuses generally have the above- 
described structure and thus have a problem in that it is 
difficult to record and reproduce high quality digital 
audio data along with video data. When the transfer 
speed of the high quality audio data is approximately 
equal to the reading speed of the optical disk, the video 
data cannot be reproduced along with the audio data by 
the method of multiplexing the audio data and the video 
data used in conventional apparatuses. The reason for 
this is that when the audio data and the video data are 
multiplexed, the transfer speed of the resultant data 
exceeds the reading speed of the optical disk and thus 
the continuous reproduction of such data cannot be 
guaranteed. The method, often used in computer appli- 
cations, of appropriately buffering audio data and video 
data so as to simultaneously reproduce the audio data 
and the video data which are recorded on different 
areas of the optical disk cannot be used. The reason for 
this is that the sum of the transfer speeds of the audio 
data and the video data needs to be equal to or lower 
than the reproduction speed of the optical disk in order 
to continuously reproduce the data recorded on disks 
such as CDs tor an hour or more. 
[0009] A BrowsaWe reproduction method of switching 
the video data based on an instruction by the user while 
continuously reproducing the audio data, or a Slide- 
Show reproduction method of automatically switching 
the video data in synchronisation with a specified posi- 
tion in the audio data are not usable. 

SUMMARY OF THE INVENTION 

[0010] According to one aspect of the invention, a 
data storage medium having stored thereon includes an 
audio sequence including at least one audio stream; at 
least one video stream; and reproduction control infor- 
mation for controlling reproduction of the at least one 
audio stream and the at least one video stream. The 
reproduction control information includes reproduction 
order information defining the order of at least one video 
stream, among the at least one video stream stored oh 
the data storage medium, which is to be reproduced in 
synchronization with the audio sequence. 
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[001 1 ] In one embodiment of the invention, the repro- 
duction control information further includes video repro- 
duction mode information indicating whether it is 
possible or not to change the order of the at least one 
video stream to be reproduced in synchronization with 
the audio sequence based on interaction from a user. 
[001 2] In one embodiment of the invention, the repro- 
duction control information further includes video repro- 
duction mode information indicating whether 
reproduction timing of the at least one video stream to 
be reproduced in synchronization with the audio 
sequence is defined or a reproduction time period 
thereof is defined. 

[001 3] In one embodiment of the invention, the repro- 
duction control information further includes reproduc- 
tion order mode information defining whether the at 
least one video stream, the order of which is defined by 
the reproduction order information, is reproduced ran- 
domly or sequentially, 

[0014] In one embodiment of the invention, the repro- 
duction order mode information indicates whether the at 
least one video stream is reproduced randomly while 
permitting overlap or randomly while prohibiting overlap. 
[001 5] In one embodiment of the invention, the audio 
sequence corresponds to at least one program, and the 
at least one video stream stored on the data storage 
medium is grouped on a program-by-program basis. 
[001 6] According to another aspect of the invention, 
an apparatus for reproducing information from a data 
storage medium having stored thereon an audio 
sequence including at least one audio stream ; at least 
one video stream; and reproduction control information 
for controlling reproduction of the at least one audio 
stream and the at least one video stream, wherein the 
reproduction control information includes reproduction 
order information defining the order of at least one video 
stream, among the at least one video stream stored on 
the data storage medium, which is to be reproduced in 
synchronization with the audio sequence, the apparatus 
includes a reading section tor reading the audio 
sequence and the at least one video stream stored on 
the data storage medium; and a reproduction section for 
reproducing the at least one video stream in synchroni- 
zation with the audio sequence in the order defined by 
reproduction order information. 

[001 7] In one embodiment of the invention, the repro- 
duction control information further includes video repro- 
duction mode information indicating whether it is 
possible or not to change the order of the at least one 
video stream to be reproduced in synchronization with 
the audio sequence based on interaction from a user. 
[0018] In one embodiment of the invention, the repro- 
duction control information further includes video repro- 
duction mode information indicating whether 
reproduction timing of the at least one video stream to 
be reproduced in synchronization with the audio 
sequence is defined or a reproduction time period 
thereof is defined. 



[001 9] In one embodiment of the invention, the repro- 
duction control information further includes reproduc- 
tion order mode information defining whether the at 
least one video stream, the order of which is defined by 
5 the reproduction order information, is reproduced ran- 
domly or sequentially. 

[0020] In one embodiment of the invention, the repro- 
duction order mode information indicates whether the at 
least one video stream is reproduced randomly while 

w permitting overlap or randomly while prohibiting overlap. 
[0021 ] In one embodiment of the invention, the at least 
one video stream to be reproduced in synchronization 
with the audio sequence is buffered into a buffer located 
in the reproduction apparatus before reproduction of the 

75 audio sequence. 

[0022] According to still another aspect of the inven- 
tion, a method for reproducing information from a data 
storage medium having stored thereon an audio 
sequence including at least one audio stream; at least 

20 one video stream; and reproduction control information 
for controlling reproduction of the at least one audio 
stream and the at least one video stream, wherein the 
reproduction control information includes reproduction 
order information defining the order of at least one video 

25 stream, among the at least one video stream stored on 
the data storage medium, which is to be reproduced in 
synchronization with the audio sequence, the method 
includes the steps of reading the audio sequence and 
the at least one video stream stored on the data storage 

30 medium; and reproducing the at least one video stream 
in synchronization wim the audio sequence in toe order 
defined by reproduction order information. 
[0023] In one embodiment of the invention, the repro- 
duction control information further includes video repro- 

35 duction mode information indicating whether it is 
possible or not to change the order of the at least one 
video stream to be reproduced in synchronization with 
the audio sequence based on interaction from a user. 
[0024] In one embodiment of the invention, the repro- 

40 duction control information further includes video repro- 
duction mode information indicating whether 
reproduction timing of the at least one video stream to 
be reproduced in synchronization with the audio 
sequence is defined or a reproduction time period 

45 thereof is defined. 

[0025] In one embodiment of the invention, In one 
embodiment of the invention, the reproduction control 
information further includes reproduction order mode 
information defining whether the at least one video 

so stream, the order of which is defined by the reproduc- 
tion order information, is reproduced randomly or 
sequentially. 

[0026] In one embodiment of the invention, the repro- 
duction order mode information indicates whether the at 
55 least one video stream is reproduced randomly while 
permitting overlap or randomly while prohibiting overlap. 
[0027] In one embodiment of the invention, the at least 
one video stream to be reproduced in synchronization 
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with the audio sequence is buffered into a buffer located 
in the reproduction apparatus before reproduction of the 
audio sequence. 

[0028] According to still another aspect of the inven- 
tion, in a data storage medium stored with a first system 
stream and a second system stream to be reproduced 
in synchronization with each other, the first system 
stream includes reproduction control information indi- 
cating timing at which the first system stream js repro- 
duced, and the second system stream is reproduced at 
timing determined by information other than information 
included in the second system stream. 
[0029] According to still another aspect of the inven- 
tion, an apparatus for reproducing a data storage 
medium stored with a first system stream and a second 
system stream to be reproduced in synchronization with 
each other, the first system stream includes reproduc- 
tion control information indicating timing at which the 
first system stream is reproduced includes a reading 
section for reading the first system stream and the sec- 
ond system stream stored on the data storage medium; 
and a decoder section for decoding the first system 
stream in accordance with the reproduction control 
information. The second system stream is decoded in 
accordance with a control signal supplied from outside 
of the decoder section. 

[0030] According to still another aspect of the inven- 
tion, a method for reproducing a data storage medium 
stored with a first system stream and a second system 
stream to be reproduced in synchronization with each 
other, wherein the first system stream includes repro- 
duction control information indicating timing at which the 
first system stream is reproduced includes the steps of 
reading the first system stream and the second system 
stream stored on the data storage medium; and control- 
ling the timing at which the first system stream is repro- 
duced in accordance with the reproduction control 
information; controlling the timing at which the second 
system stream is reproduced in accordance with infor- 
mation other than information included in the second 
system stream. 

[0031] Thus, the invention described herein makes 
possible the advantages of providing (1) an optical disk 
for realizing reproduction of high quality digital audio 
data along with video data in a restricted range of bit 
rates; (2) an optical disk, and an apparatus and method 
for reproducing data from such an optical disk for realiz- 
ing switching between a mode in which high quality dig- 
ital audio data and video data are reproduced 
synchronously with each other and a mode in which 
high quality digital audio data and video data are repro- 
duced asynchronously from each other and the video 
data is switched based on time or by an instruction from 
the user; and (3) a reproduction apparatus for allowing 
a decoder used for a general MPEG2 stream to be used 
for reproducing audio data and video data as independ- 
ent streams and thus for realizing reproduction of high 
quality digital audio data along with video data in a 
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restricted range of bit rates at a relatively low cost. 
[0032] These and other advantages of the present 
invention will become apparent to those skilled in the art 
upon reading and understanding the following detailed 
5 description with reference to the accompanying figures. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0033] 

10 

Figure 1 A is an external view of an optical disk in a 
first example according to the present invention; 

Figure 1B is a cross-sectional view of the optical 
75 disk shown in Figure 1 A; 

Figure 1 C is an enlarged cross-sectional view of the 
optical disk shown in Figure 1 A; 

Figure 1D shows pits formed in the optical disk in 
the first example according to the present invention; 

Figure 2A shows a track structure of the optical disk 
shown in Figure 1 A; 

Figure 2B shows a sector structure of the optical 
disk shown in Figure 1 A; 

Figure 3 shows a logical structure of the optical disk 
shown in Figure 1 A; 

Figure 4 shows a data structure of an audio man- 
ager; 

Figure 5 shows a data structure of an audio title set; 

Figure 6 shows an exemplary structure of an audio 
object (AOB); 

Figure 7 shows a data structure of PGC information 
and cell information; 

Figure 8 is a block diagram showing an internal 
structure of a DVD player, which is a reproduction 
apparatus; 

Figure 9 shows an example of PGC forming a title; 

Figure 10 shows an example of program informa- 
tion; 

Figure 1 1 shows an example of cell information; 

Figure 12A shows a process for reproducing audio 
data without displaying a still picture; 

Figure 12B shows a process for reproducing audio 
data without displaying a still picture; 
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Figure 13A shows a process for reproducing audio 
data while displaying a still picture; 

Figure 13B shows a process for reproducing audio 
data while displaying a still picture; s 

Figure 14 shows a conventional process for repro- 
ducing audio data; 

Figure 1 5 shows a schematic flow for reproducing a 10 
program; 

Figure 16 shows a schematic flow for reproducing a 
leading audio cell; 

15 

Figure 17 shows a flow for reproducing a silent cell; 

Figure 18 shows a flow for reproducing a continu- 
ously reproduced cell; 

20 

Figure 19 is a DVD player and peripheral devices 
connected thereto; 

Figure 20 shows a remote controller used for oper- 
ating the DVD player; 25 

Figure 21 shows a logical structure of a DVD as a 
multimedia optical disk in a second example 
according to the present invention; 

30 

Figure 22 shows a structure of an audio manager; 

Figure 23 shows a data structure of an AOB of the 
DVD in the second example; 

35 

Figure 24 shows an exemplary structure of the 
RQB; 

Figure 25 shows a data structure of PGC informa- 
tion of the DVD in the second example; 40 

Figure 26 is a block diagram showing an internal 
structure of a DVD player in the second example; 

Figure 27 is a block diagram showing an internal 45 
structure of a DVD player including a system 
decoder for AOB and P_VOB; 

Figure 28 is a block diagram showing an internal 
structure of a DVD player including a P_VOB buffer so 
immediately before an AV decoder section; 

Figure 29 shows the relationship among PGC, 
audio program, cell, picture program, picture cell, 
AOB and P_VOB; ss 

Figure 30 shows an example of program informa- 
tion forming a title; 
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Figure 31 shows a P_VOB of the DVD in the second 
example; 

Figure 32 shows an exemplary audio program; 

Figure 33 shows another exemplary audio program; 

Figure 34 shows the number of silent cell packs and 
audio cell packs of audio data of 48 kHz, 16-bit 
sampling and 2 channels; 

Figures shows picture cell information when the 
number of still picture packs is 50; 

Figures 36A and 36B show methods for reproduc- 
ing a silent cell and an audio cell; 

Figure 37 is a flowchart illustrating a method for 
producing one of programs; 

Figure 38 is a flowchart illustrating a method for 
producing one of the programs; 

Figure 39 is a flowchart illustrating a method for 
producing one of the programs; 

Figure 40 shows a video menu for allowing the user 
to select a title; 

Figure 41 shows a structure of P_PCI; 

Figure 42 shows a structure of P_DSI; 

Figure 43 shows a logical structure of an optical 
disk in a third example according to the present 
invention; 

Figure 44 shows a data structure of an audio man- 
ager (AMG); 

Figure 45 shows a data structure of an audio still 
video set (ASVS); 

Figure 46 shows a data structure of P_VOB: 

Figure 47 shows a data structure of highlight infor- 
mation; 

Figure 48 shows a data structure of an audio title 
set (ATS); 

Figure 49 shows a data structure of PGC informa- 
tion (ATS_PGCI); 

Figure 50 illustrates an audio still video display 
mode; 

Figure 51 shows a data structure of an MPEG sys- 
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tern stream; 

Figure 52 shows the relationship among P_VOB, 
ASVU. ATS.PGCI and AOB; 

Figure 53 shows an example of PGC information; 

Figure 54 shows an example of audio program 
information (ATS_PGI); 

Figure 55 shows an example of ATS audio still 
video reproduction information (ATS_ASV_PBI); 

Figure 56 shows an example of cell information; 

Figure 57 shows an example of ASV search 
pointer; 



Figure 69 illustrates the relationship between 
DLIST and P_VOB. 

DESCRIPTION OF THE EMBODIMENTS 

5 

[0034] Hereinafter, the present invention will be 
described by way of illustrative examples with reference 
to the accompanying drawings. 

10 (Example 1) 

[0035] An optical disk in a first example according to 
the present invention will be described. 

75 (1) Physical structure of the optical disk 



Figure 58 shows an example of ASVU general 
information; 20 

Figure 59A shows the relationship between the 
audio reproduction timing and the still picture dis- 
play timing in the "SlideShow" mode; 

25 

Figure 59B shows the relationship between the 
audio reproduction timing and the still picture dis- 
play timing in the "Browsable" mode; 

Figure 60 illustrates an audio still video display so 
mode; 

Figure 61 is a block diagram illustrating an internal 
structure of a DVD player as a reproduction appara- 
tus; 35 

Figure 62 is a flowchart illustrating a method for 
reproducing PGC information; 

Figure 63 is a flowchart illustrating a method for 40 
reproducing audio program; 

Figure 64 is a flowchart illustrating a method for 
determining key operation; 

45 

Figure 65 is a flowchart illustrating a method for 
reproducing audio data; 

Figure 66 is a flowchart illustrating a method for 
reproducing a still picture; so 

Figure 67 is a block diagram illustrating an internal 
structure of another DVD player as a reproduction 
apparatus; 



55 



Figure 68 is a block diagram illustrating an internal 
structure of still another DVD player as a reproduc- 
tion apparatus; and 



[0036] Figure 1 A shows an external appearance of a 
DVD 107, which is a multimedia optical disk. Figure 1B 
shows a cross-section of the DVD 107 taken along line 
A- A* in Figure 1 A. Figure 1C is an enlarged cross-sec- 
tional view of portion B shown in Figure 1B. 
[0037] As shown in Figure 1B, the DVD 107 is formed 
by sequentially laminating a first transparent substrate 
108, an information layer 109, an adhesive layer 110, a 
second transparent substrate 111, and a print layer 112 
on which a label is to be printed in this order. 
[0038] The first transparent substrate 108 and the 
second transparent substrate 111 are reinforcement 
substrates formed of the same material. In the example 
shown in Figure 1 B, these substrates each have a thick- 
ness of about 0.6 mm. These substrates can each have 
a thickness in the range between about 0.5 mm and 
about 0.7 mm. 

[0039] The adhesive layer 110 is provided between 
the information layer 109 and the second transparent 
substrate 111 in order to adhere the information layer 
1 09 to the second transparent substrate 111. 
[0040] On the face of the information layer 1 09 in con- 
tact with the f irst transparent substrate 108 is formed a 
reflective film (not shown) formed of a thin metal film or 
the like. By using a molding technique, a high density of 
convex and concave pits are formed in the reflective 
film. 

[0041] Figure 1 D shows the shapes of pits formed in 
the reflective film. In the example shown in Figure 1D, 
each pit has a length of 0.4 jim to 2.1 3 tun. A spiral track 
is formed in the DVD 107. The pits are formed alongside 
the spiral track so as to be at a distance of 0.74 from 
the spiral track in a radial direction of the DVD 107. 
Thus, an array of pits are formed along the spiral track 
[0042] When a light beam 113 is directed toward the 
DVD 107, a light spot 114 is formed on the information 
layer 1 09 as shown in Figure 1 C. The information stored 
on the DVD 1 07 is detected as a variation in the reflect- 
ance of the portion of the information layer 119 which is 
irradiated by the light spot 114. 
[0043] The diameter of the light spot 1 14 of the DVD 
107 is about 1/1.6 times the diameter of a light spot on 
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a CD because the numerical aperture (NA) of an object 
lens for DVDs is larger than the numerical aperture of an 
object lens for CDs and because the wavelength X of a 
light beam for DVDs is shorter than the wavelength X of 
a light beam for CDs. 

[0044] A DVD having the above<lescribed physical 
structure can store about 4.7 Gbytes of information on 
one side. A storage capacity of about 4.7 Gbytes is 
nearly 8 times the storage capacity of a conventional 
CD. Such a large storing capacity of DVDs can greatly 
improve the quality of moving pictures and also greatly 
extend the reproducible time period of moving pictures. 
Whereas the reproducible time period of a conventional 
video CD is 74 minutes, a DVD boasts a reproducible 
time period of 2 hours or more. 

[0045] The fundamental technology which realizes 
such a large storing capacity lies in reduction of the spot 
diameter d of a light beam. The spot diameter d is cal- 
culated from the formula: spot diameter d = laser wave- 
length a/numerical aperture NA of the object lens. 
Accordingly, the spot diameter d can be minimized by 
decreasing the laser wavelength X and increasing the 
numerical aperture NA of the object lens. It should be 
noted, however, that increasing the numerical aperture 
NA of the object lens can result in coma aberration due 
to the relative slant of the optical axis of the light beam 
with respect to the face of the disk (i.e.. lilt"). In the case 
of DVDs, the coma aberration is minimized by reducing 
the thickness of a transparent substrate. However, 
reducing the thickness of a transparent substrate can 
result in anotfier problem of lower mechanical Sffength 
of the disk. In the case of DVDs, the strength of the 
transparent sifcstrate is reinforced by attaching another 
substrate to the transparent substrate, thereby over- 
coming the problems associated with the mechanical 
strength of the disk. 

[0046] A red semiconductor laser having a short 
wavelength of 650 nm and an object lens having a large 
numerical aperture (NA) of about 0.6 mm are used to 
read the information stored on a DVD. By employing a 
thin transparent substrate having a thickness of about 
0.6 mm in addition to the above-mentioned laser and 
object lens, it is possible to store about 4.7 Gbytes of 
information on one side of an optical disk having a diam- 
eter of 120 mm. 

[0047] Figure 2A schematically illustrates a spiral 
track 20 formed from the inner periphery toward the 
outer periphery of the information layer 109 of the DVD 
107 (Figure 1 A). The spiral track 20 is divided into pre- 
scribed units referred to as sectors. In Figure 2A, the 
respective sectors are denoted by .81", S2, .... S99, and 
S100. The information stored on the DVD 107 is read on 
a sector-by-sector basis. 

[0048] Figure 2B shows an internal structure of a sec- 
tor, where the sector includes a sector header region 
21 \ a user data region 22, and an error correction code 
storage region 23. The sector header region 21 stores a 
sector address 



[0049] for identifying the sector and an error detection 
code therefor. Based on these sector addresses, a disk 
reproduction apparatus determines one of a plurality of 
sectors from which information should be read. 
5 [0050] The user data region 22 stores a 2 Kbyte long 
data string. 

[0051] The error correction code storage region 23 
stores error correction codes for the sector header 
region 21 and the user data region 22 included in the 
10 same sector. A disk reproduction apparatus performs 
error detection by using the error correction codes when 
reading data from the user data region 22 and performs 
error correction based on the error detection results, 
thereby insuring the reliability of data reading. 

15 

(2) Logical structure of the optical disk 

[0052] Figure 3 shows a logical structure of the DVD 
107 (Figure 1 A). As shown in Figure 3, the region of the 

20 DVD 107 is divided into a lead-in region 31, a volume 
region 32, and a lead-out region 33. These regions can 
be identified by identification information included in 
sector addresses of physical sectors. The physical sec- 
tors are arrayed in an ascending order based on their 

25 sector addresses. 

[0053] In the lead-in region 31 , data for stabilizing the 
operation of a reproduction apparatus at the beginning 
of read, for example, is stored. 

[0054] No meaningful data is stored in the lead-out 
30 region 33. The lead-out region 33 is used for informing 
the disk reproduction apparatus df the termination of 
reproduction. 

[0055] The volume region 32 stores digital data corre- 
sponding to an application. The physical sectors 

35 included in the volume region 32 are managed as logi- 
cal blocks. The logical blocks are identified by sequen- 
tial numbers (logical block numbers) respectively 
assigned to the physical sectors after the first physical 
sector (designated as the 0th physical sector) in the vol- 

40 ume region 32. 

[0056] As shown in Figure 3, the volume region 32 is 
subdivided into a volume file management region 32a 
and an audio zone region 32c. 

[0057] The volume file management region 32a stores 
45 file system management information for managing a 
plurality of logical blocks as files in accordance with 
IS013346. The file system management information is 
information indicating the correspondence between the 
name of each of a plurality of files and the addresses of 
so the logical blocks occupied by that file. A disk reproduc- 
tion apparatus achieves file-by-file access to the optical 
disk based on the file system management information. 
Specifically, the disk reproduction apparatus refers to 
the file system management information to obtain the 
55 addresses of the logical blocks corresponding to a given 
file name, and accesses the logical blocks based on 
these addresses. As a result, the digital data contained 
in a desired file can be read. 
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[0058] An audio zone region 32c stores an audio man- 
ager 900 and one or more audio title sets 800. 
[0059] The audio title set 800 contains a plurality of 
pieces of audio data and management information for 
managing the order of reproducing the plurality of 
pieces of audio data. An audio title set 800 has a data 
structure which allows for management of the audio 
data in units which are referred to as audio titles. Typi- 
cally* an audio title corresponds to a music album con- 
taining one or more tunes. 

(3) Data structure of the audio zone region 32c 

[0060] The audio zone region 32c stores an audio 
manager 900 and one or more audio title sets 800. 

(3.1) Data structure of the audio title set 800 

[0061] Figure 5 shows a data structure of an audio title 
set 800. The audio title set 800 includes a plurality of 
audio objects (hereinafter, referred to as "AOBs") 802, 
audio title set management information (ATSI) 801 for 
managing the order of reproducing the plurality of AOBs 
802, and audio title set management information 
backup (ATSIJBUP) 804 which is backup data of the 
audio title set management information 801. In the fol- 
lowing description, an "audio title set" will generally be 
referred to as an "ATS". 

(3.1 .1) Data structure of the AOB 802 

[0062] The AOB 802 is packetized into 2 Kbytes. The 
AOB 802 stores data in the LPCM format, AC3 format, 
DTS format or other compression formats. In the case 
of the LPCM format, the sample bit is 16, 20. or 24 bits, 
with a sampling frequency of 48 kHz, 96 kHz, 192 kHz, 
44.1 kHz, 88.2 kHz, or 176.4 kHz. 

(3.1 .2) Data structure of the audio title set management 
information 801 

[0063] The audio title set management information 
(ATSI) 801 includes information for managing the order 
of reproducing the AOBs 802. The order of reproducing 
the AOBs 802 is designated by a program chain (PGC), 
as in the case of video objects (VOBs). Different PGCs 
can define different orders of reproduction of AOBs 802. 
[0064] As shown in Figure 5, the audio title set man- 
agement information (ATSI) 801 includes an ATS man- 
agement table (ATSLMAT) 811 and a PGC 
management information table (ATS_PGCIT) 812. 
[0065] The ATS management table 811 serves as 
header information of the audio title set management 
information 801. The ATS management table 811 
includes a pointer indicating a region in which each 
AOB 802 is stored, a pointer indicating a region in which 
the PGC management information table 812 is stored, 
and attribute information on each AOB 802. In the case 
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where a still picture is stored on the DVD 107 (Figure 
1A), the ATS management table 811 also includes 
attribute information and the like for the still picture. 
[0066] The PGC management information table 
5 (ATS_PGCIT) 812 includes ATS PGC management 
table information (ATS_PGCITI) 831, a plurality of ATS 
PGC information search pointers (ATS_PGCLSRPs) 
832, and a plurality of PGC information (ATS_PGCI) 
units 833. 

w [0067] The ATS PGC information search pointers 
(ATS_PGCLSRPs) 832 are indices for a plurality of 
PGCs stored in the PGC management information table 
812 and designate the PGC information to be executed 
first on a title-by-title basis. 

is [0068] Each of the PGC information units describes 
positions on the optical disk 107 where one or more 
audio objects are recorded and the order of reproduc- 
tion thereof. Reproduction of the same audio object can 
be described by different PGC information. Specifically. 

20 each PGC information unit includes "ATS PGC general 
information (ATS_PGC_GI)". "ATS program information 
table (ATS_PGIT)", and "audio cell reproduction infor- 
mation table (ATS_C_PBIT)". 

[0069] Figure 7 shows a data structure of each PGC 

25 information unit. As shown in Figure 7, "ATS PGC gen- 
eral information (ATS_PGC_GI)" includes the number of 
programs and the number of cells included in the PGC 
information, the reproduction time period of the PGC. 
and information on pointers to "ATS program informa- 

ao tion table (ATS_PGIT)" and to "audio cell reproduction 
information table (ATS_C_PBIT)". 
[0070] The "ATS program information table 

(ATS_PGIT)" includes "program physical allocation 
information" indicating whether or not the AOB of each 

35 program and the AOB of the previous program are 
recorded at physically discontinuous positions on the 
optical disk, "program time attribute information" indicat- 
ing whether or not the time information of the AOB is 
continuous with the time information of the previous 

40 AOB, "program start cell number" indicating the first ceil 
number forming the program, "program still picture flag" 
indicating whether or not the program includes a still 
picture, "reproduction start audio cell time" indicating 
the first time information of the first audio cell included in 

45 the program, "program total reproduction time period" 
indicating the reproduction time period of the program, 
and "audio pause time period" indicating the silent 
period until the reproduction of the audio cell of the pro- 
gram starts. 

so [0071] The "audio cell reproduction information table 
(ATS_C_PBIT)" stores cell information forming the AOB 
to be reproduced. Specifically, the "ATS cell reproduc- 
tion information table" includes "cell index number" indi- 
cating the order of cells included in the program, "cell 

55 type" indicating the attribute of the cell (i.e.. whether the 
cell is a still picture cell, a silent cell or an audio cell), 
"cell start address" incficating the start address of the 
cell in the form of a relative address with respect to the 
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first pack of the AOB of the ATS including the cell, and 
"cell termination address" indicating the final address of 
the cell in the same manner. 

[0072] Figure 6 shows an exemplary structure of the 
AOB. The AOB is part of an MPEG2 stream and s 
includes a still picture cell including a still picture pack, a 
silent cell including an audio pack including substan- 
tially silent audio data, and an audio cell including an 
audio pack of audio data forming a tune. The AOB 
includes one or more audio cells and may not include a 10 
still picture cell or a silent cell. Still picture cells are 
never continuous with each other and silent cells are 
never continuous with each other. A still picture cell is 
always followed by a silent cell or an audio cell. A silent 
cell is always followed by an audio cell, in the example is 
shown in Figure 6, audio cells and silent cells exist and 
are physically continuous. Time information is also con- 
tinuous; i.e., the "program physical allocation informa- 
tion" has a value representing "continue", and the 
"program time attribute information" also has a value 20 
representing "continue". 

[0073] In Figure 6, the line passing through points B, 
C, E, F and G represents a change in the value of the 
time information (i.e., PTS) of the AOB. Point A repre- 
sents a value of the PTS of the still picture cell. Point B 25 
represents a value of the first PTS of a silent cell. Point 
C represents a value of the first PTS of an audio cell. 
Thus, the PTS of a silent cell is continuous to the PTS of 
an audio cell, and there is no data underflow gap, as 
referred to with respect to the MPEG2 stream, between 30 
the silent ceH andihe audio cefh Point B,- at which the 
value of the PTS of the silent cell is larger than the value 
of the PTS of the still picture cell, represents that a still 
picture in the still picture cell is displayed before the 
reproduction of the silent cell starts. The PTS of the next 35 
still picture cell is represented by point D, and the PTS 
of the next silent cell is represented by point E. In the 
case where the PTS of the still picture cell and the PTS 
of the silent cell have an equal value, the reproduction of 
the silent cell starts simultaneously when the still picture 40 
is displayed. The difference between the first PTS of the 
next audio cell at point F and the final PTS of the audio 
cell at point D is "audio pause time period". Since the 
gap between the PTS's needs to be 0.7 seconds or less 
as specified by MPEG2, the difference at point B 45 
between the first PTS of the silent cell and the PTS of 
the still picture cell, and the difference at point D 
between the first PTS of the still picture cell and the final 
PTS of the audio cell need to fulfill such a condition. 
[0074] Figure 9 shows an exemplary PGC forming a so 
title. The example shown in Figure 9 includes five pro- 
grams. Programs #1 and #2 correspond to AOB #1 , and 
programs #3, #4 and #5 correspond to AOB #2. On the 
information storing medium, AOB #1 is recorded after 
AOB #2. Programs #1 and #2 both include a still picture ss 
cell and a silent cell, and program #2 includes two audio 
ceils. Programs #3 and #4 includes only a silent cell, 
and program #5 includes only an audio cell. 



[0075] In the case where all the audio cells each have 
a reproduction time period of 60 seconds (5,400,000 in 
PTS), all the silent cells each have a reproduction time 
period of 1 second (90,000 in PTS), and the PTS of the 
still picture cell has the same value as that of the first 
PTS of the silent cell, program information can be 
described as shown in Figure 10. Under the conditions 
that the size of the still picture data is about 1 .88 Mbits 
and the audio data is 48 kHz, 16 bit sampling with 2 
channels, the number of the still picture packs is 112, 
the number of the silent cell packs is 96, and the 
number of the audio cell packs is 5760 as shown in Fig- 
ure 11- 

[0076] The audio title set has been described. Next, 
with reference to Figure 4, the audio manager will be 
described. 

(3.2) Data structure of the audio manager 900 

[0077] The audio manager 900 is information for con- 
trolling the reproduction which is to be first referred to for 
performing audio-oriented reproduction of the informa- 
tion on the optical disk by a reproduction apparatus. 
[0078] Figure 4 shows a data structure of the audio 
manager 900. 

[0079] The audio manager 900 includes "audio man- 
ager information (AMGI)". "VOB for audio manager 
menu (AMGM_VOBS)", and "audio manager informa- 
tion backup (AMGLBUP)". 

[0080] The "audio manager information (AMGI)" 
includes "audio manager information management 
table (AMGI_M AT)" including attribute information and 
pointer information, "audio title management informa- 
tion" indicating the number of audio titles and the like, 
"audio title search pointer (ATT_SRP)" indicating search 
information on the audio titles, and "audio manager 
menu PGC management information table 
(AMGM_PGCLUT)" indicating PGC information for 
audio manager menu. 

[0081] Each "audio title search pointer (ATT_SRP)" 
includes "audio title type" indicating the type of each 
title, "number of programs in the titles" indicating the 
number of programs included in the titles, "title repro- 
duction time period" indicating the reproduction time 
period of the titles, "ATS number" indicating the set 
number of the ATS to which each title belongs, "ATS title 
number" indicating the title number of each title in the 
ATS, and "ATS address" indicating the address of the 
ATS to which each title belongs. 
[0082] The audio zone region and the DVD as a mul- 
timedia optical disk have been described. Next, a repro- 
duction apparatus according to the present invention 
will be described. 

[0083] First an external appearance of a DVD player, 
which is a reproduction apparatus for multimedia optical 
disks will be described. Figure 19 shows external 
appearances of a DVD player 1, a TV monitor 2 and a 
remote controller 91. 
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[0084] The DVD player 1 has an opening in a front 
face of a chassis thereof, and includes a driving mecha- 
nism for loading an optical disk in the depth direction of 
the opening. 

[0085] Provided on the front face of the DVD player 1 s 
is a remote control receiving section 92 having a light 
receiving element for receiving infrared rays emitted by 
the remote controller 91. When an operator uses a 
remote controller held in his/her hand, the remote con- 
trol receiving section 92 sends an interrupt signal indi- 10 
eating that a key signal has been received. 
[0086] Provided on a rear face of the DVD player 1 are 
video and audio output terminals. A video signal repro- 
duced from the DVD can be output to the large TV mon- 
itor 2 for home use by connecting an AV cord to the is 
video and audio output terminals. Thus, the operator 
can enjoy the video reproduced from the DVD on a large 
TV of, for example, 33 inches or 35 inches. As can be 
appreciated from the above description, the DVD player 
1 in this example is not used in the state of being con- so 
nected to a personal computer but is used with the TV 
monitor 2 as an electronic appliance for home use. 
[0087] The remote controller 91 includes a spring- 
loaded keypad on a surface of a casing thereof, and out- 
puts a code corresponding to a pressed key by infrared 25 
rays. Figure 20 shows an operation panel of the remote 
controller 91. The "POWER" key on the operation panel 
is for turning the DVD player 1 ON or OFF. The "MENU" 
key is used for retrieving the volume menu of the optical 
disk during reproduction of the program chain. Ten keys so 
are used for, for example, chapter-jumping the movie or 
selecting music tunes. Upward, downward, leftward and 
rightward cursor keys are used for selecting items. The 
"ENTER" key is used for confirming an item selected by 
the cursor. When the cursor is moved on items by the 35 
upward, downward, leftward and rightward cursor keys, 
the item on which the cursor is positioned is displayed 
with a select color of item color information of the man- 
agement information pack. When the item is confirmed 
by the "ENTER" key. the item is displayed with a conf ir- 40 
mation color. In addition, there are keys which are com- 
mon with other AV apparatuses such as, for example, 
"PLAY**, "STOP". -PAUSE". "FORWARD" and 
"REWIND" keys. 

[0088] Next, the structure of the DVD player, which is 45 
a reproduction apparatus for multimedia optical disks 
will be described. 

[0089] Figure 8 is a block diagram of an internal struc- 
ture of a DVD player 70. The DVD player 70 includes a 
driving mechanism 81, an optical pickup 82, a mecha- so 
nism control section 83, a signal processing section 84, 
an AV decoder section 85, a remote control receiving 
section 92 and a system control section 93. 
[0090] The driving mechanism 81 includes a table on 
which an optical disk is set and a spindle motor for ss 
clamping and rotating the optical disk which is set on the 
table. The table is structured to be moved in and out of 
the chassis by an eject mechanism not shown. When 



the table is out of the chassis, the operator sets an opti- 
cal disk onto the table. When the optical disk is set on 
the table and the table is moved back into the chassis, 
the optical disk is loaded on the DVD player 70. 
[0091] The mechanism control section 83 controls a 
mechanical system including the motor for driving the 
disk and the optical pickup 82 for reading a signal 
recorded on the disk. Specifically, the mechanism con- 
trol section 83 adjusts the motor speed in accordance 
with the position of the track indicated by the system 
control section 93. The mechanism control section 83 
also controls an actuator of the optical pickup 82 to 
move the optical pickup 82. When an accurate track is 
detected by servo control, the mechanism control sec- 
tion 83 instructs the optical disk to wait until a desired 
physical sector on the rotating optical disk reaches the 
position of the optical pickup 82. Then, signals are con- 
tinuously read from the desired position. 
[0092] The signal processing section 84 processes 
the signal read from the optical pickup 82 with, for 
example, amplification, waveform shaping, binarization, 
demodulation, and error correction to convert the signal 
into a digital data stream, and then stores the resultant 
data stream in a buffer memory in the system control 
section 93 (described below) on a logical block-by-logi- 
cal block basis. 

[0093] The AV decoder section 85 processes the dig- 
ital data which is an input VOB, in a prescribed manner 
to convert the data into a video signal or an audio signal. 
Specifically, the AV decoder section 85 includes a sys- 
tem decoder 86, a video decoder 87 and an audio 
decoder 88. 

[0094] The system decoder 86 receives the digital 
data stream transferred from the buffer memory on a 
logical block-by-logical block basis (i.e., on a packet-by- 
packet basis) and distinguishes a stream ID and a sub 
stream ID in the header of each packet, thereby classi- 
fying the data into a moving picture data pack, an audio 
data pack and a management information pack. At the 
point, the moving picture data pack is output to the 
video decoder 87. Regarding the audio data pack, only 
an audio data pack having a designated stream number 
is output to the audio decoder 88 in accordance with a 
decode stream instruction command sent from the sys- 
tem control section 93. The management information 
pack is output to the system control section 93. The 
moving picture data pack input to the video decoder 87 
is extended by the prescribed format defined by the 
MPEG2 format and output as digital video data. Then, 
the digital video data is converted into a video signal of 
the NTSC format arid externally output. The audio data 
input to the audio decoder 88 is decoded in the LPCM or 
AC3 format in accordance with the data type. D/A con- 
verted, and then externally output as an audio signal. 
[0095] The audio decoder 88 processes the digital 
data, which is an input AOB, in a prescribed manner in 
accordance with the data type, converts the data into an 
audio signal, and externally outputs the data. 
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[0096] The system control section 93 includes a work- 
ing memory and a CPU integrated together and per- 
forms the overall control of the DVD player 70. 
[0097] The reproduction apparatus according to the 
present invention operates in, for example, the following s 
manner. 

[0098] When a DVD is loaded in the DVD player 70, 
the system control section 93 detects that a DVD is 
loaded from the optical sensor or the like. Then, the sys- 
tem control section 93 controls the mechanism control 10 
section 83 and the signal processing section 84, 
thereby controlling rotation of the DVD so as to cause 
the optical pickup 82 to seek to the lead-in region. Thus, 
the DVD player 70 is initialized and reproduction starts. 
[0099] For starting the reproduction, the system con- 15 
trol section 93 determines whether or not the DVD 
player 70 is in a video-oriented reproduction mode, by a 
reproduction mode determination section. When it is 
determined that the DVD player 70 is in the video-ori- 
ented reproduction mode, the system control section 93 so 
reads the video manager based on the information 
which is read from the volume file management region. 
The system control section 93 refers to the PGC man- 
agement information table for the video manager menu 
to calculate the recording address of the PGC for the 25 
volume menu. The resultant PGC is reproduced and 
retained inside. When the PGC for the volume menu is 
retained inside, the system control section 93 refers to 
the retained PGC information to calculate the video 
object (VOB) to be reproduced and the recording 30 
address of the VOB on the optical disk. When the VOB 
to be reproduced is determined, the system control sec- 
tion 93 outputs a control signal to the mechanism con- 
trol section 83 and the signal processing section 84 so 
as 4o retrieve the determined VOB from the DVD for 35 
reproduction. Thus, the video menu from which the user 
can select the title to be reproduced is displayed. 
[0100] The user sees the menu and selects and con- 
firms the title that he/she is interested in by designating 
the item number in the menu using the remote control- 40 
ler. Then, the system control section 93 receives the 
designation of the item number in the menu from the 
remote controller and refers to the management infor- 
mation pack included in the VOB of the video menu 
which is being reproduced, the VOB being input from 45 
the AV decoder section 85, to execute a control com- 
mand corresponding to the designated number. The 
control command is PlayTrtle #n or the like, and the title 
number to be reproduced is designated by "n". As an 
execution operation by the PlayTrtle command, the sys- so 
tern control section 93 refers to the title search pointer 
table, which is a part of the audio manager, to determine 
the audio title set (ATS) to which the designated title 
belongs to and the title number in the ATS. When the 
ATS is confirmed, the system control section 93 outputs ss 
a control signal to the mechanism control section 83 
and the signal processing section 84 so as to reproduce 
the ATS management information of the confirmed title 



set and retrieves the title search pointer table of the 
ATS, which is a part of the ATS management informa- 
tion, to the system control section 93. When the title 
search pointer table is retrieved, the system control sec- 
tion 93 refers to the table to determine the PGC informa- 
tion for starting the reproduction of the title to be 
reproduced. When the PGC information is determined, 
the system control section 93 outputs a control signal to 
the mechanism control section 83 and the signal 
processing section 84 so as to reproduce the deter- 
mined PGC information and retain the information in the 
inside buffer memory for the PGC information. When 
the PGC information is retained, the system control sec- 
tion 93 refers to the retained PGC information to deter- 
mine the audio object (AOB) to be reproduced and the 
recording address thereof. Then, the AOB is repro- 
duced by a control signal output to the mechanism con- 
trol section 83 and the signal processing section 84 
from the system control section 93. 
[0101] The system control section 93 sequentially 
determines the AOBs to be reproduced and controls the 
reproduction thereof in accordance with the retained 
PGC information. When the reproduction of the final 
AOB indicated by the PGC information is completed, the 
system control section 93 searches for the PGC infor- 
mation of the next title and reproduces the AOBs 
described in the PGC information in the same manner. 
Thus, all the titles are reproduced and the operation is 
stopped. In accordance with specific setting of the DVD 
player or the DVD. only one title can be reproduced 
before the operation is stopped, or a menu can be dis- 
played after reproduction of one or more titles is com- 
pleted. 

[0102] Next, a method for reproducing a still picture 
cell, a silent ceil and an audio cell will be described in 
detail. 

[0103] Figure 14 shows a conventional method for 
reproducing the audio cell. Conventionally, for reproduc- 
ing an MPEG2 stream, the leading end of the audio cell 
pack is first sought and data reading starts. However, 
audio output does not immediately start but starts after 
an idle time period, which is predetermined in each 
reproduction apparatus. The idle time period includes a 
time period required for determining the PTS of the 
audio data, a time period required for determining that 
data is correct, and a time period required for the muting 
circuit in an analog output section to be transitioned 
from a mute state to a non-mute state. Thus, the idle 
time period varies in accordance with the type of repro- 
duction apparatus. Especially when the reproductibn 
section and the decoder are accommodated in separate 
chassis, the idle time period tends to be extended 
because determination on the PTS and the determina- 
tion on the correctness of the audio data are separately 
performed. 

[01 04] Figures 1 2A and 1 2B show methods for repro- 
ducing only audio data without displaying a still picture 
even though the still picture is included. Figure 12A 
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shows the method in the case where the reproduction is 
performed from the leading end of the data, and Figure 
1 2B shows the method in the case where the reproduc- 
tion is performed in continuation from the previous cell. 
[0105] In the case where reproduction is performed s 
from the leading end of the data by selecting a title or 
program from the menu or the like through jumping, the 
leading pack of the silent cell is jumped to with reference 
to the start address of the ceil information. At this point, 
the STC, which is the reference time of the decoder is io 
set with the SCR of the leading pack of the silent cell. 
Next, skipping and jumping of data corresponding to a 
prescribed idle time period are performed with refer- 
ence to the PTS of the silent cell, and the silent cell for 
the idle time period is reproduced. Audio output is pre- is 
pared for during this period. At this point, the next title 
number is not displayed on the display device of the 
reproduction apparatus, and the passage of reproduc- 
tion time period is not updated. In the case where the 
audio output is started and display of the title number 20 
and the updating of the passage of the reproduction 
time period are performed simultaneously when the first 
PTS of the leading cell is detected, the reproduction 
apparatus behaves as if the audio data is output simul- 
taneously with the completion of the jump such as menu 25 
selection. In the case where the reproduction is per- 
formed in continuation from the previous cell, a still pic- 
ture pack is jumped over and a silent cell is jumped over 
or skipped except for the idle time period. At this point, 
by continuously counting the STC. which is reference for 30 
the system time, the interval between the completion of 
the audio output by the previous cell and the start of the 
audio output by the next cell equals the audio pause 
time period. The counting of the STC is continued when 
the physical allocation information and the time attribute 35 
information both have a value indicating "continue". 
When either one of them incficates "no-continue", the 
same processing as in the jump from the menu is per- 
formed, and thus the STC is reset at the leading end of 
the silent cell pack. Even in the case where there is no 40 
still picture ceil, the method is substantially the same as 
described above. In the case where there is no silent 
cell and the reproduction starts from the leading end of 
the data, the operation is similar to the conventional 
operation shown in Figure 14. In the case where the 45 
reproduction is performed in continuation from the pre- 
vious cell, the operation is as follows. When the time 
attribute information indicates "continue", the audio cell 
is decoded continuously from the previous cell and 
audio data is output. When the time attribute information so 
indicates "no-continue", the operation is the same as 
the conventional operation shown in Figure 14 as in the 
case where the reproduction is performed from the 
leading end of the data. 

[01 06] Figures 1 3 A and 1 3B show methods for repro- ss 
ducing the audio data while displaying a still picture. 
Figure 13A shows the method in the case where the 
reproduction is performed from the leading end of the 



data, and Figure 13B shows the method in the case 
where the reproduction is performed in continuation 
from the previous cell. 

[0107] In the case where the reproduction is per- 
formed from the leading end of the data or by selecting 
a title or program from the menu or the like through 
jumping, the pack of the still picture cell is sought from 
the cell information. Thus, the still picture cell is read 
and decoded. At this point, the STC, which is the refer- 
ence time of the decoder, is set with the SCR of the 
leading pack of the still picture cell. Next, the leading 
pack of the silent cell is read. When an STC reaches an 
STC of the still picture cell, the still picture is displayed. 
The rest of the operation is the same as the operation 
when there is no still picture. The display of the still pic- 
ture can be during the processing of the silent cell, 
simultaneously with the start of output of the audio cell, 
or after the start of output of the audio cell in the range 
permitted by the MPEG2 format. In the case where the 
reproduction is performed in continuation from the pre- 
vious cell, the still picture cell is read and decoded when 
a still picture pack is detected. Next, the silent cell is 
processed. Whether or not the STC is set depends on 
the time attribute information as in the case where there 
is no still picture cell. The rest of the operation is the 
same as the operation when no still picture is displayed, 
except that the still picture is displayed when the STC 
becomes a PTS of the still picture cell and that the STC 
is set with the SCR of the leading pack of the still picture 
cell. 

[0108] Figures 15 through 18 are flowcharts each 
illustrating a program. In the case where the physical 
allocation information of the program information shows 
"continue" and the program previously reproduced has 
a program number which is smaller by one than the pro- 
gram number of the program to be reproduced now, 
seek to the reading head for the DVD is not specifically 
necessary. When the time attribute information indi- 
cates "continue" and the reproduction is performed in 
continuation from the previous program, the STC, which 
is the reference time for the decoder, does not need to 
be reset. 

[01 09] The determination of whether or not the cell is 
a still picture cell can be performed using the cell type in 
the cell information or using a still picture flag of the pro- 
gram information. The determination of whether or not 
the cell is a silent cell is performed by the cell type in the 
cell information. The determination that the cell is the 
final cell is performed by comparing the termination 
address of the cell information and the address read 
from the data in the DVD. The completion of the repro- 
duction of the program can be determined by the cell 
index of the cell information returning to zero, the cell 
type indicating a still picture cell or a silent ceil, or the 
start cell number of the next program in the program 
information. 

[0110] The skipping of the silent cell is performed 
while detecting the PTS when the silent cell is decoded 
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or by obtaining the number of packs to be skipped 
based on the data rate. 

[0111] As described above, in the first example 
according to the present invention, program reproduc- 
tion information including the start time and reproduc- s 
tion time period of each of audio data based on the 
reproduction start time of the leading audio data in an 
MPEG2 stream is recorded in the management region 
as a part of the reproduction control information. 
Accordingly, a multimedia information storing medium 10 
which realizes reproduction of high quality digital audio 
data and video data added thereto under a restricted bit 
rate can be provided. The interval between the repro- 
duced audio data can be uniform even in low-cost repro- 
duction apparatuses or reproduction apparatuses is 
having no reproduction function for video data. Thus, 
title creators can prepare data easily. 



(Example 2) 



(2) J-ogical structure of the optical disk 

[01 14] TTie logical structure of the optical disk is iden- 
tical with that described in the first example and thus the 
description thereof will be omitted. 

(3) Data structure of the audio zone region 32c 
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[0112] An optical disk, and an apparatus and method 
for reproducing information from the optical disk in a 
second example according to the present invention will 
be described. Identical elements previously discussed 
in the first example bear identical reference numerals 25 
and the descriptions thereof will be omitted. 

(1) Physical structure of the optical disk 

[0113] The physical structure of the optical disk is 30 
identical with that described m the tirst example and 
thus the description thereof will be omitted. 
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[0115] The audio zone region 32c stores an audio 
manager 900 and one or more audio title sets 800 as in 
the first example. The data structure of the audio man- 
ager 900 and the data structure of the audio title sets 45 
800 are different from those in the first example. 

(3.1) Data structure of the audio title set 800 

[0116] Figure 23 shows a data structure of the audio so 
title set 800 in the second example. The audio title set 
800 includes a plurality of audio objects (hereinafter, 
referred to as "AOBs") 802, a plurality of picture video 
objects (hereinafter, referred to as "P_VOBs" 1002), 
audio title set management information (ATSI) 801 for 55 
managing the order of reproducing the plurality of AOBs 
802 and the plurality of P_VOBs 1002, and audio title 
set management information backup (ATSI_BUP) 804 



which is backup data of the audio title set management 
information 801. In the following description, an "audio 
title set" will generally be referred to as an "ATS". 

(3.1.1) Data structure of the AOB 802 

[01 17] The data structure of the AOB 802 of the opti- 
cal disk is identical with that described in the first exam- 
ple and thus the description thereof will be omitted. 

(3. 1 .2) Data structure of P_VOB 1 002 

[0118] Figure 31 shows a data structure of the P_VOB 
1002. The P_VOB 1002 includes a picture NV pack 
(P_NV_PCK) 1003, a plurality of video packs (V_PCK) 
1004, and a plurality of sub-picture packs (SP_PCK) 
1005. 

[01 19] Unlike in the case of DVD-Video, the P_VOB 
1002 includes one picture video object unit (P_VOBU), 
and only one picture NV pack (P_NV_PCK) 1003 is 
included. 

[0120] The sub-picture packs (SP_PCK) 1005 can 
each have a plurality of sub-picture streams and has an 
identification code as in the case of DVD-Video. The 
sub-picture packs (SP_PCK) 1005 can be omitted. 
[01 21 ] The P_VOB 1 002 does not include audio data, 
unlike in the case of DVD-Video. The picture NV pack 
(P_NV_PCK) 1003 includes a P__PCl packet and a 
P_DSI packet in a similar manner as a PCI packet and 
a P_DSI packet of DVD-Video. 

[0122] ^As shown in Figure 41 , the P_PCt fncTudes 
P_VOBU to which the P_PCI belongs to (i.e., attribute 
information, reproduction time information, highlight 
information and the like of P_VOB), as in the case of 
DVD-Video, but does not include address information. 
[01 23] As shown in Figure 42, the P_DSI includes an 
SCR of the NV_PCK to which the PJDSI belongs, and a 
termination address of the P_VOBU, i.e., P_VOB to 
which the P_DSI belongs, and address information of a 
pack including the final data of the first I picture of the 
video pack (P_PCK) as in the case of DVD-Video, but 
does not include other information. 

(3. 1 .3) Data structure of the audio title set management 
information 801 

[0124] The audio title set management information 
(ATSI) 801 includes information for managing the order 
of reproducing the AOBs 802 and the P_VOB 1 002. The 
order of reproducing the AOBs 802 is designated by a 
program chain (PGC), as in the case of video objects 
, (VOBs). 

[0125] The data structure of the audio title set man- 
agement information 801 is identical with that shown in 
Figure 5 except for the data structure of the PGC infor- 
mation 833. 

[0126] Each of the PGC information units describes 
positions on the optical disk where one or more audio 
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objects are recorded and the order of reproduction 
thereof. Reproduction of the same audio object can be 
described by a different PGC information unit. Specifi- 
cally, the PGC information includes "ATS PGC general 
information (ATS_PGC_GI)". "ATS program information s 
table (ATS_PGIT)\ "audio cell reproduction information 
table (ATS_C_PBIT)". "ATS picture program information 
table (ATS_PPGIT)", and "ATS picture cell reproduction 
information table (ATS_PC_PBny. 
[0127] Figure 25 shows a data structure of the PGC 10 
information. As shown in Figure 25, "ATS PGC general 
information (ATS_PGC_GI)" includes the number of pic- 
ture programs, the number of audio programs and the 
number of cells included in the PGC information; the 
reproduction time period of the PGC; and information is 
on pointers to a picture program reproduction mode, 
picture program reproduction control, the "ATS program 
information table (ATS_PGIT)Y "audio cell reproduction 
information table (ATS_C_PBIT)", "ATS picture program 
information table (ATS_PPGIT)". and "ATS picture cell so 
reproduction information table (ATS_PC_PBIT)". 
[0128] The "ATS program information table 

(ATS_PGIT)" includes a plurality "audio program infor- 
mation (ATS_PGI)" units included in the PGC. Each of 
the plurality of "audio program information (ATS_PGI)" 25 
units includes "audio program physical allocation info- 
ramtion" indicating whether or not the AOB of each pro- 
gram and the AOB of the previous program are 
recorded at physicaDy discontinuous positions on the 
optical disk, "audio program time attribute information" 30 
indicating whether or not the time information of the 
AOB is continuous with the time information of the pre- 
vious AOB, "audio program start cell number" indicating 
the f irst cell number forming the audio program, "repro- 
duction start audio cell time" indicating the first time 35 
information of the first audio cell included in the audio 
program, "audio program total reproduction time period" 
indicating the reproduction time period of the audio pro- 
gram, and "audio pause time period" indicating the 
silent period until the reproduction of the audio cell of 40 
the audio program starts. 

[0129] The "audio cell reproduction information table 
(ATS_C_PBIT)" stores cell information forming the AOB 
to be reproduced as in the first example. 
[0130] The "ATS picture program information table 45 
(ATS_PPGIT)" includes a plurality of "ATS picture pro- 
gram information" units. Each ATS picture program 
information unit includes a used SP number 
(PPG_SP_STN) describing a stream number of a sub- 
picture used in the picture program, an initial button so 
number (PPG_FOSL) indicating the number of a button 
which is in the highlight information used in the picture 
program and is selected in an initial state, a picture tran- 
sition mode (PPG_T_mode) indicating a transition 
mode of a still picture in the program, a still picture iden- ss 
trfication code (PPGJSRC_SPCT) indicating an ISRC 
code of the still picture in the program, a picture cell 
start PTS (PC_S_PTM) indicating a PTS of a first pack 



of the picture cell of the program, a picture program 
reproduction time period (ATS_PPG_PB_TM) which is 
a reproduction time period of the program, a start sec- 
tion transition time period (PPG_TI_TM) indicating the 
transition time period at the time of starting the repro- 
duction of the still picture of the program, and a termina- 
tion section transition time period indicating the 
transition time period at the time of terminating repro- 
duction of the still picture of the program. 
[0131] In this example, as the picture transition 
modes, "no transition", "fading from black", "fading into 
black", "cross-fading", and "wiping in several directions" 
are assumed. Each of the programs includes one cell. 
Since each cell has an independent P_VOB, the repro- 
duction time period of the picture program is a 1 -frame 
time period. 

[0132] The "ATS picture cell reproduction information 
table (ATS_PC_PBIT)" includes a cell still time period 
(P_C_Stili_Time) indicating a continuous time period 
during which the picture cell is displayed, a start 
address (P_C_FVOBLL_SA) of a cell of the picture pro- 
gram, and a termination address (P_C_LVOBU_SA) 
indicating an address of the final packet of the cell. 
[01 33] Figure 24 shows an exemplary structure of the 
AOB. The AOB is a part of an MPEG2 stream and 
includes a silent cell including an audio pack including 
substantially silent audio data and an audio cell includ- 
ing an audio pack of audio data forming a tune. The 
AOB includes one or more audio cells and may not 
include a silent cell. A silent cell is never followed by 
another silent cell and is always followed by an audio 
cell. In the example shown in Figure 24, audio cells and 
silent cells exist and are physically continuous. Time 
information is also continuous; i.e., the "audio program 
physical allocation information" has a value represent- 
ing "continue", and the "audio program time attribute 
information" also has a value representing "continue". 
[01 34] In Figure 24, the line passing through points B, 
C, E, F and G represents a change in the value of the 
time information (i.e., PTS) of the AOB. Point B repre- 
sents a value of the first PTS of a silent cell. Point C rep- 
resents a value of the first PTS of an audio cell. Thus, 
the PTS of a silent cell is continuous to the PTS of an 
audio cell, and there is no data underflow gap, as 
referred to with respect to the MPEG2 stream, between 
the silent cell and the audio cell. In a similar manner, the 
PTS of the next silent cell is represented by point E. The 
difference between the first PTS of the next audio cell at 
point F and the final PTS of the audio cell at point D is 
an "audio pause time period". Since the PTS's need to 
be continuous in one stream as specified by MPEG2. 
the change in the PTS is linear as shown in Figure 24. 
[0135] With reference to Figures 31, 41 and 42, the 
picture video object (P_VOB) will be described in detail. 
As described above, the leading pack of the P_VOB 
1 002 is a picture NV pack (P_NV_PCK) 1003 including 
a P_PCT packet and a P_DSI packet having time infor- 
mation, position information and highlight information. 
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Only one picture NV pack (PJMV_PCK) 1003 always 
* exists at the leading end thereof. The picture NV pack 
(P_NV_PCK) 1003 is followed by video packs (V_PCK) 
1004 which are MPEG2 video packs for storing video 
data and sub-picture packs (SP_PCK) 1005 for dividing 
sub-picture streams which are permitted to be included 
up to 32 streams and storing the divided sub-picture 
streams. In Figure 31, 2 sub-picture streams 1005 
(SP#1 and SP#2) are included and each sub-picture 
stream is divided into 2 packs. In Figure 31, the video 
packs (V_PCK) 1004 precede the sub-picture packs 
(SP_PCK) 1005, but there is no limitation regarding the 
order as long as the MPEG2 standards are fulfilled. 
Although termination codes such as program encodes 
or the like are not shown in Figure 31, such termination 
codes can be included as long as the requirements of 
MPEG2 streams are fulfilled. Such termination codes 
need not be included since the P_VOB 1002 is a part of 
the MPEG2 program stream. 

[01 36] Figure 41 shows a structure of the P_PCI infor- 
mation included in the P_PCI packet. The P_PCI infor- 
mation includes P_PCI general information, P_PCI 
highlight information, and P__PCI recording information. 
The P_PCI general information includes a P_VOBU 
type (P_VOBU_CAT) describing the type of the analog 
copy protection with respect to the P_VOBU video data. 
P_VOBU start time (P_VOBU_S_PTM) describing the 
PTS of the first vide?o to be displayed included in the 
P_VOBU, P_VOBU termination time 
(P_VOBU_E_PTM) describing the PTS of the final 
video to be displayed included in the P_VQBU, 
P_VOBU__SEQ time (P_VOBU_SEQ_E_PTM) describ- 
ing the PTS including the SEQ existing in the P_VOBU, 
and P_VOBU cell reproduction time 
(P_VOBU_C_ELTM) indicating relative time in a picture 
cell of the first video to be displayed of the P_VOBU. 
The P_VOB is always an I picture of 1 VOBU, 1 cell and 
1 video frame, and always includes SEQ. Accordingly, 
the P_VOBU_S_PTM, P_VOBU_E_PTM, 

P_VOBU_SE_E_PTM have an identical value, and the 
p_VOBU_E_ELTM indicates 0. 

[0137] The P_PCI highlight information includes 
P_PCI highlight general information (P_PCLHL_GI) 
describing the start time, termination time, number of 
buttons, the initial selection button number, forcible exe- 
cution button number and the like of the highlight infor- 
mation; a P_PCI button color information table 
(P_BTN_COUT) describing button color information; 
and a P_PCI button information table (P_BTN1T) 
describing the coordinate, magnitude, operation mode 
and the like of the button. Since the P_VOBU includes 
one video frame as described above, the P_PCl_HL_GI 
is described so that the start time of the highlight infor- 
mation indicates 0 and the termination time indicates 
infinity. 

[0138] The P_PCI recording information includes a 
P_PCI video ISRC code (P_ISRC_V) describing an 
ISRC code of the video of the P_VOBU; and a P_PCI 



sub-picture ISRC code (PJSRC_SP) describing an 
ISRC code of the sub-picture of the P__VOBU. 
[01 39] Figure 42 shows a structure of the PJDSI infor- 
mation described in the PJDSI packet. The P_DSI 

5 includes only the P_DSI general information 
(P_DSIJ3I). The P_DSI general information includes 
P_VOBU_NV pack time (PJMV_PCK_SCR) corre- 
sponding to the least significant 32 bits of the SCR of 
the NV pack of the PJ/OBU, a P_VOBU termination 

io address describing the relative address with respect to 
the NV pack of the final pack of the P^VOBU. and a 
P_VOBU first reference address 

(P_VOBLMSTREF_EA) describing the relative 
address with respect to the NV pack of the pack includ- 

75 ing the final data of the first I picture of the video of the 
P_VOBU. Since the P_VOBU includes 1 VOBU, the 
P_NV_PCK_SCR is always 0. 

[0140] Figure 29 shows the relationship among the 
PGC, audio program, cell, picture program, picture cell, 

20 AOB, and P_VOB. The entirety of Figure 29 represents 
one PGC. A plurality of such PGCs are included in one 
ATS. As described above, a PGC includes audio pro- 
grams describing the reproduction order by the audio 
data access unit and picture programs describing the 

25 reproduction order by the video data access unit. Each 
audio program has a cell, which is the minimum man- 
agement unit. The cell points to an AOB which is actual 
audio data and thus determines the reproduction data. 
In a similar manner, each picture program has a picture 

30 cell, which points to a P_VOB, which is actual video 
data, and determines the contents to be reproduced. 
[0141] Due to such a structure, actual reproduction 
data pieces can be recorded on the disk in a different 
order. Alternatively, the same data piece can be pointed 

35 by a plurality of cells or picture ceils. In order to guaran- 
tee the reproduction continuity, the audio data pieces 
has a flag which indicates whether the physical layout of 
the data on the disk is continuous or the data is a con- 
tinuous part of one AOB (i.e., audio program physical 

40 allocation information, audio program time attribute 
information). The video data is fundamentally a still pic- 
ture. Accordingly, one video data program includes one 
ceil, and one cell includes one P_VOB. The physical 
arrangement of the P_VOBs causes no specific prob- 

45 lems. Thus, the video data does not have such a flag. 
[0142] Figure 30 shows an exemplary PGC forming a 
title. The example shown in Figure 30 includes five pro- 
grams. Programs #1 and #2 correspond to AOB #1 , and 
programs #3, #4 and #5 correspond to AOB #2. On the 

so recording medium, AOB #1 is recorded after AOB #2. 
Programs #1 and #2 both include a silent cell and a 
silent cell, and program #2 includes two audio cells. Pro- 
grams #3 and #4 include a silent cell, and program #5 
includes only an audio cell. 

55 [01 43] in the case where all the audio cells each have 
a reproduction time period ol 60 seconds (5,400,000 in 
PTS) and all the still picture cells each have a reproduc- 
tion time period of 1 second (90,000 in PTS), audio pro- 
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gram information can be described as shown in Figure 
32. Picture program information can be described as 
shown in Figure 33. Under the conditions that the audio 
data is 48 kHz, 16 bit sampling with 2 channels, the 
number of the silent packs is 96 and the number of the 5 
audio cell packs is 5760 as shown in Figure 34. Where 
the number of still picture packs is 50, the picture cell 
information is as shown in Figure 35. In such a case, it 
is assumed that the P__VOB used in the picture cell is 
recorded at a position following an AOB. The recording w 
positions for the P_VOB and AOB are not physically lim- 
ited, and the P_VOB and AOB can be positioned in a 
mixed state with no specific problem. In such a case, the 
physical allocation of the audio program information of 
no-Continue indicates that the AOB indicated by the 75 
audio program is not continuously reproducible. The 
audio cell and the picture cell both share the AOBs and 
P_VOBs used by the other celjs. 
[0144] The audio title set has been described. Next, 
with reference to Figure 22, the audio manager will be 20 
described. 

(3.2) Data structure of the audio manager 900 

[0145] The audio manager 900 is information for con- 25 
trolling the reproduction which is to be first referred to for 
performing audio-oriented reproduction of the informa- 
tion on the optical disk by a reproduction apparatus. 
[0146] Figure 22 shows a data structure of the audio 
manager 900. 30 
[0147] The audio manager 900 includes "audio man- 
ager information (AMGI)", "VOB for audio manager 
menu (AMGM_VOBS)", and "audio manager informa- 
tion backup (AMGLBUP)" as in the first example. 
[0146] The "audio manager information (AMGI)" 35 
includes "audio manager information management 
table (AMGI_MAT)" including attribute information and 
pointer information, "audio title management informa- 
tion" indicating the number of audio titles and the like, 
"audio title search pointer (ATT_SRP)" indicating search 40 
information on the audio titles, and "audio manager 
menu PGC management information table 
(AMGM_PGCI_UT)" indicating PGC information for 
audio manager menu, as in the first example. 
[0149] The "audio manager information management 45 
table (AMGl_MAT)" includes an AMG identifier 
(AMG_ID) for identifying an audio manager, an AMG 
termination address (AMG_EA) which is a termination 
address for table management, an AMGI termination 
address (AMGI_EA), an AMGLMAT termination so 
address ( AMGI_M AT_EA) , address information on the 
AMG and AMGI tables, version number (VERN), a vol- 
ume set identifier (VLMSJD), a number of title sets 
(TS_Ns) indicating the number of the title sets included 
in the volume, a provider identifier (PRVJD) for identify- ss 
ing the provider which produced the disk, an automatic 
execution flag (Auto_Play_Flag) defining the operation 
of the player when the disk is inserted, and 



AMGM_VOBS indicating VOB attribute information 
existing in the audio manager. 

[0150] The "audio title search pointer table 
(ATT_SRP)" has an identical structure with that in the 
first example. 

[0151] The audio zone region and the DVD as a mul- 
timedia optical disk have been described. Next, a repro- 
duction apparatus, according to the present invention, 
for reproducing a media having the above-described 
structure will be described. 

[0152] The basic structure of the reproduction appa- 
ratus is similar to the structure described in the first 
example. In the second example, a key for separately 
controlling audio data and video data is provided in a 
remote controller for operating the DVD player or a front 
panel of the DVD player. With such a key, the video data 
can be skipped forward and backward or rewound to the 
start of the video data while continuously reproducing 
the audio data. 

[0153] Figure 26 is a block diagram illustrating an 
internal structure of a DVD player 80 in the second 
example. The DVD player 80 is different from the DVD 
player 70 shown in Figure 8 in the structure of the AV 
decoder section, which will be described below. 
[01 54] In the second example, there are two streams, 
which are an AOB as audio data and P_VOB as video 
data. An AV decoder section 85a includes two decoders 
respectively corresponding to the AOB and P_VOB. 
These decoders are operated synchronously with or 
asynchronously from each other. Specifically, the AV 
decoder section 85a includes a P_VOB system decoder 
103 for receiving a P_VOB stream and distinguishing a 
stream ID and sub-stream ID in a header of each packet 
included in the stream, thereby classifying the data into 
a video packet, a PCI packet, and a sub-picture packet. 
The AV decoder section 85a further includes an AOB 
system decoder 104 for receiving an AOB stream and 
classifying the data on a packet header-by-packet 
header basis. The AV decoder section 85a still further 
includes a highlight buffer 94 for temporarily storing 
highlight information of a PCI packet from the P_VOB 
system decoder 103, a video buffer 96 for temporarily 
storing a video packet in a similar manner, a sub-picture 
buffer 105 for temporarily storing a sub-picture packet, 
an audio buffer 99 for temporarily storing an audio 
packet from the AOB system decoder 104, a highlight 
decoder 95 for decoding the highlight information from 
the highlight buffer 94 and outputting the decoded high- 
light information to the system control section 93, a 
video decoder 87 for decoding the video data from the 
video buffer 96, a sub-picture decoder 98 for decoding 
the sub-picture data from the sub-picture buffer 1 05, an 
audio decoder 100 for decoding the audio data from the 
audio buffer 99 and outputting the decoded audio data 
as an audio output, a video synthesizer 101 for synthe- 
sizing the decoding results of the video decoder 87 and 
the sub-picture decoder 98 into one piece of video data, 
and a synchronization adjusting section 102 for manag- 
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ing the synchronization of the highlight decoder 95, the 
video decoder 87, the sub-picture decoder 98 and the 
audio decoder 1 00. 

[0155] The reproduction apparatus according to the 
present invention operates in, for example, the following s 
manner. The operation up to the reading of the stream 
is the same as described in the first example and will not 
be described below. 

[0156] For starting the reproduction, the system con- 
trol section 93 determines whether or not the DVD io 
player 80 is in a video-oriented reproduction mode, by a 
reproduction mode determination section. When it is 
determined that the DVD player 80 is in the video-ori- 
ented reproduction mode, the system control section 93 
reads the video manager based on the information is 
which is read from the volume file management region. 
The system control section 93 refers to the PGC man- 
agement information table for the video manager menu 
to calculate the recording address of the PGC for the 
volume menu. The resultant PGC is reproduced and so 
retained inside. When the PGC for the volume menu is 
retained inside, the system control section 93 refers to 
the retained PGC information to calculate the video 
object (VOB) to be reproduced and the recording 
address of the VOB on the optical disk When the VOB 25 
to be reproduced is determined, the system control sec- 
tion 93 outputs a control signal to the mechanism con- 
trol section 83 and the signal processing section 84 so 
as to retrieve the determined VOB from the optical disk 
for reproduction. Thus, the video menu from which the 30 
user cart Select the title to be reproduced is displayed 
(see Figure 40). 

[0157] The user sees the menu and selects and con- 
firms the title that he/she is interested in by designating 
the item number in the menu using the remote control- 35 
ler. Then, the system control section 93 receives the 
designation of the item number in the menu from the 
remote controller and refers to the management infor- 
mation pack included in the VOB of the video menu 
which is being reproduced, the VOB being input from 40 
the AV decoder section 85a, to execute a control com- 
mand corresponding to the designated number. The 
control command is PlayTrtle #n or the like, and the title 
number to be reproduced is designated by "n". As an 
execution operation by the PlayTitle command, the sys- 45 
tern control section 93 refers to the title search pointer 
table, which is a part of the audio manager, to determine 
the audio title set (ATS) to which the designated title 
belongs to and the title number in the ATS. When the 
ATS is confirmed, the system control section 93 outputs so 
a control signal to the mechanism control section 83 
and the signal processing section 84 so as to reproduce 
the ATS management information of the confirmed title 
set and retrieves the title search pointer table of the 
ATS. which is a part of the ATS management informa- 55 
tion. to the system control section 93. When the title 
search pointer table is retrieved, the system control sec- 
tion 93 refers to the table to determine the PGC informa- 



tion for starting the reproduction of the title to be 
reproduced. When the PGC information is determined, 
the system control section 93 outputs a control signal to 
the mechanism control section 83 and the signal 
processing section 84 so as to reproduce the deter- 
mined PGC information and retain the information in the 
inside buffer memory for the PGC information. When 
the PGC information is retained, the system control sec- 
tion 93 first refers to the ATS picture program informa- 
tion table, with reference to the retained PGC 
information, to read all the P_VOBs described in the 
PGC information and inputs the P_VOBs to the AV 
decoder section 85a. The AV decoder section 85a sep- 
arates the input P_VOBs on a pack-by-pack basis or 
packet-by-packet basis by the P_VOB system decoder 
103 and stores the P_VOBs in corresponding buffer 
memories. 

[01 58] At this point, the highlight decoder 95, the video 
decoder 87, and the sub-picture decoder 98 do not per- 
form decoding operations. When all the P_VOBs are 
stored in the corresponding buffer memories, the AOB 
to be reproduced and the recording address thereof are 
determined by the ATS program information table and 
the ATS cell reproduction information table. Then, the 
AOB is reproduced by a control signal output to the 
mechanism control section 83 and the signal process- 
ing section 84 from the system control section 93. At 
this point, the highlight decoder 95, the video decoder 
87, and the sub-picture decoder 98 perform decoding 
operations for the first time to start video output, high- 
tight processing and audio output. The duQ>ut timing fe 
adjusting by the synchronization adjusting section 102 
in accordance with the audio program information, pic- 
ture program information, highlight information and time 
code information (PTS) of each packet. 
[0159] In the second example, data for the corre- 
sponding picture programs is all stored in the corre- 
sponding buffers before the audio data is reproduced. 
Accordingly, the highlight buffer 94, the video buffer 96, 
and the sub-picture buffer 1 05 in the DVD player 80 in 
the second example require a larger buffer memory 
capacity than the capacity specified by MPEG2 or DVD- 
Video. 

[0160] Thereafter, the system control section 93 
sequentially determines the AOBs to be reproduced 
and controls the reproduction thereof in accordance 
with the retained PGC information, and also decodes 
the highlight information, sub-picture information and 
video information which are already read in the buffer 
memories. When the reproduction of the final AOB indi- 
cated by the PGC information is completed, the system 
control section 93 searches for the PGC information of 
the next title and reproduces the P_VOBs and the AOBs 
described in the PGC information in the same manner. 
[0161] Thus, the all the titles are reproduced and the 
operation is stopped. In accordance with specific setting 
of the DVD player or the DVD, only one title can be 
reproduced before the operation is stopped, or a menu 
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can be displayed after reproduction of one or more titles 
is completed. 

[0162] Next, a method for reproducing a still picture 
cell, a silent cell and an audio cell will be described in 
detail. 5 
[0163] Figure 14 shows a conventional method for 
reproducing the audio cell. Conventionally, for reproduc- 
ing an MPEG2 stream, the leading end of the audio cell 
pack is first sought by the optical head and data reading 
starts. However, audio output does not immediately 10 
start but starts after an idle time period, which is prede- 
termined in each reproduction apparatus. The idle time 
period includes a time period required for determining 
the PTS of the audio data, a time period required for 
determining that data is correct, and a time period is 
required for the muting circuit in an analog output sec- 
tion is to be transitioned from a mute state to a non- 
mute state. Thus, the idle time period varies in accord- 
ance with the type of reproduction apparatus. Especially 
when the reproduction section and the decoder are 20 
accommodated in separate chassis, the idle time period 
tends to be extended because determination on the 
PTS and the determination on the correctness of the 
audio data are separately performed. 
[01 64] Figures 36 A and 36B show methods for repro- 25 
ducing a silent cell and an audio cell. Figure 36 A shows 
the method in the case where the reproduction is per- 
formed from the leading end of the data, and Figure 36B 
shows the method in the case where the reproduction is 
performed in continuation from the previous cell. 30 
[0165] In the case where reproduction is performed 
from the leading end of the data by selecting a title or 
program from the menu or the like through jumping, the 
leading pack of the silent cell is jumped to with reference 
to the start address of the cell information. At this point, 35 
the STC, which is the reference time of the decoder, is 
set with the SCR of the leading pack of the silent cell. 
Next, skipping and jumping of data corresponding to a 
prescribed idle time period are performed with refer- 
ence to the PTS of the silent cell, and the silent cell for 40 
the idle time period is reproduced. Audio output is pre- 
pared for during this period. At this point, the next title 
number is not displayed on the display device of the 
reproduction apparatus, and the passage of reproduc- 
tion time period is not updated. In the case where the 45 
audio output is started and display of the title number 
and the updating of the passage of the reproduction 
time period are performed simultaneously when the first 
PTS of the leading cell is detected, the reproduction 
apparatus behaves as rf the audio data is output simul- so 
taneously with the completion of the jump such as menu 
selection. 

[0166] In the case where the reproduction is per- 
formed in continuation from the previous cell, a silent 
cell is jumped over or skipped except for the idle time ss 
period. At this point, by continuously counting the STC, 
which is reference for the system time, the interval 
between the completion of the audio output by the pre- 



vious cell and the start of the audio output by the next 
cell equals the audio pause time period. The counting of 
the STC is continued when the physical allocation infor- 
mation and the time attribute information both have a 
value indicating "continue". When either one of them 
indicates "no-continue", the same processing as in the 
jump from the menu is performed, and thus the STC is 
reset at the leading end of the silent cell pack. In the 
case where there is no silent cell and the reproduction 
starts from the leading end of the data, the operation is 
similar to the conventional operation shown in Figure 
14. In the case where the reproduction is performed in 
continuation from the previous cell, the operation is as 
follows. When the time attribute information indicates 
"continue", the audio cell is decoded continuously from 
the previous cell and audio data is output. When the 
time attribute information indicates "no-continue", the 
operation is the same as the conventional operation 
shown in Figure 14 as in the case where the reproduc- 
tion is performed from the leading end of the data. 
[0167] Figures 37 through 39 are flowcharts each 
illustrating a program. Since information on the P_VOBs 
is read in the corresponding buffer memories, reproduc- 
tion of the audio data and processing of the highlight 
information can be performed asynchronously from 
each other, and the still picture and the sub-picture can 
be output asynchronously from each other. Needless to 
say. synchronous display is possible based on the time 
code or the time information of the system. 
[01 68] Regarding reproduction of audio information, in 
the case where the physical allocation information of the 
audio program information shows "continue" and the 
program previously reproduced has a program number 
which is smaller by one than the program number of the 
program to be reproduced now- seek to the reading 
head for the DVD is not specifically necessary. When 
the time attribute information indicates "continue" and 
the reproduction is performed in continuation from the 
previous program, the STC, which is the reference time 
for the decoder, does not need to be reset. 
[01 69] The determination of whether or not the cell is 
a silent cell is performed using the cell type in the cell 
information. The determination that the cell is the final 
cell is performed by comparing the termination address 
of the cell information and the address read from the 
data in the DVD. The completion of the reproduction of 
the audio program can be determined by the cell index 
of the cell information returning to zero, the cell type 
indicating a still picture cell or a silent cell, or the start 
cell number of the next program in the audio program 
information. 

[0170] The skipping of the silent cell is performed 
while detecting the PTS when the silent cell is decoded 
or by obtaining the number of packs to be skipped 
based on the data rate. 

[01 71 ] Since data of a still picture is already stored in 
the buffer memory, the display of the still picture is man- 
aged based on the position in the buffer memory at 
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which the still picture of the designated cell of the desig- 
nated program is recorded or based on the ordinal 
number (i.e.. first second, etc.) of the still picture with 
respect to the start. When the management is per- 
formed based on the position in the buffer memory, the 
data is sent to the video decoder 87 from the designated 
address in the video buffer 96 and displayed. When the 
management is performed based on the ordinal number 
of the still picture, the data stored in the video buffer 96 
from the leading end thereof is sent to the video 
decoder 87 and the display is switched when the desig- 
nated ordinal number is reached. The transmission of 
the data is stopped at this point. The display time period 
of the still picture is in accordance with the description in 
the picture cell still time (P_C_Still_Time). Since still pic- 
tures are each 1 -frame data, the present still picture is 
switched into the next still picture when the time period 
described in the picture cell still time has passed. Such 
control is usually performed by the system control sec- 
tion 93. A picture cell still time of FFh indicates infinity, 
which means that the still picture is switched basically 
only by a command, user operations, or completion of 
the audio program. When the still picture is switched, 
display effects can be added by designating the transi- 
tion mode. The time to switch the still picture can be 
designated on a program-by-program basis before and 
after the display of the still picture. However, in the case 
where the reproduction apparatus does not have a spe- 
cial effect display function, the still picture can simply be 
switched with the display effects being ignored. 
[0172] In a synchronous mode, the audio data is 
reproduced based on the audio program reproduction 
time period and the time code information of the audio 
data, and the still picture and the sub-picture are repro- 
duced based on the picture program reproduction time 
period. The synchronisation adjusting section 102 com- 
pares the reference time information of the system and 
the reproduction time periods and thus performs syn- 
chronization control. 

[0173] In an asynchronous mode, audio data is not 
synchronized with and reproduced separately from 
highlight data, sub-picture data and still picture data. 
The highlight data, sub-picture data and still picture data 
are synchronized to one another, and the audio data is 
continuously reproduced realtime based on the time 
information of the system. The still picture is switched in 
the asynchronous mode by the user operations and the 
command. The user operations basically include an 
operation of continuously reproducing audio programs 
while feeding picture programs forward, backward or to 
the leading end; and an operation of feeding both audio 
programs and picture programs simultaneously forward, 
backward or to the leading end. The command has sim- 
ilar functions. 

[01 74] The synchronous mode and the asynchronous 
mode are switched over in accordance with the picture 
program reproduction control. In the case where the pic- 
ture program reproduction control is designated so as to 



prohibit control by the user operations, the data is basi- 
cally reproduced in the synchronous mode except for by 
the command. 

[01 75] In the case where the picture program repro- 

5 duction mode prevents the user operations and com- 
mand from synchronizing audio programs and picture 
programs with each other, the DVD player can wait until 
the synchronisation is obtained or the picture programs 
can be continuously reproduced in accordance with the 

10 picture program reproduction time period without syn- 
chronization. In the case where the reproduction of the 
audio programs is not completed when the reproduction 
of the picture programs is completed, the picture pro- 
grams can be continuously reproduced from the first 

15 picture program or the final picture program can be con- 
tinuously reproduced. In the case where the reproduc- 
tion of all the picture programs are completed before all 
the audio programs are completed, only the picture pro- 
grams can be reproduced with the audio part being 

20 silent, the audio programs can be reproduced from the 
first audio programs, or the reproduction of the audio 
title can be terminated. Some of the audio programs 
and the picture programs included in the audio title can 
be randomly selected and repeated a designated 

25 number of times. Such audio and picture programs can 
be repeated so that no same program is repeated (shuf- 
fle mode). The audio programs and the picture pro- 
grams are synchronized to each other so that the audio 
program and the picture program having the same 

30 number can be always reproduced simultaneously, or 
the audio programs and the picture programs can be 
reproduced independently and asynchronously. 
[0176] An audio title number or time can be directly 
designated for reproduction by user operations, i.e., 

35 usually by a remote controller or other keys. Such repro- 
duction by direct designation by user operations cannot 
be used when the directly designated reproduction pro- 
hibition flag (UOP1) and the time designated reproduc- 
tion prohibition flag (UOP2) of the ATT_CAT indicate 

40 prohibition. By such a designation, the title creators can 
describe reproducible audio titles by a command under 
some conditions of the reproduction apparatus. For 
example, an audio title which is not usually reproducible 
can be reproduced under certain conditions. For exam- 

45 pie, a certain parameter can be set in a specific players 
so that the parameter can be evaluated by a command, 
and the command can be described so as to reproduce 
the audio title only when a value is set in the parameter. 
When UOP 1 and UOP2 are set so as to prohibit directly 

so designated reproduction, the audio title can be repro- 
duced only by the specific player. Such a system easily 
allows reproduction of certain audio titles, recorded on a 
disk including a plurality of audio titles, for which fee has 
been paid. 

55 [0177] AV decoder sections having other structures 
are usable. For example, as shown in Figure 27, an AV 
decoder section 85b also acts as a system decoder for 
AOB and P_VOB. Since the AOB and P_VOB are input 
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at different timing, the same type of AV decoder section 
as used by DVD-Video is usable except that the capac- 
ities of the highlight buffer 94, the video buffer 96, and 
the sub-picture buffer 105 are increased and that a still 
picture management mechanism is added to the control 5 
of the video buffer 96. The operation of the AV decoder 
section 85b is substantially the same as that of the AV 
decoder section 85a. 

[0178] The DVD player can have the structure shown 
in Figure 28. In this structure, a P_VOB buffer 106 for 10 
P_VOB streams is provided immediately before the AV 
decoder section 85b. The same type of AV decoder 
section as used by DVD-Video is usable. All the P_VOB 
data to be read before the reproduction of the audio 
data is stored in the P_VOB buffer 106. The P_VOB 75 
buffer 106 dynamically multiplexes the P_VOBs corre- 
sponding to the reproduction order with the audio 
stream and sends the resultant data to the AV decoder 
section 85b. Accordingly, the system decoder 120 for 
AOB and P_VOB in the AV decoder section 85b is 20 
required to have a processing speed which is slightly 
higher than the speed specified by the DVD-Video 
standards. Streams need to be supplied to the AV 
decoder section 85b so that the audio buffer 99 does 
not underflow. 25 
[0179] In the second example, as described above, 
program reproduction information including the start 
time and reproduction time period of each piece of 
audio data based on the reproduction start time of the 
leading audio data in the MPEG2 stream is recorded in 30 
the management region as a part of the reproduction 
control information. Accordingly, an optical disk for real- 
izing reproduction of high quality digital audio data 
along with the video data in a restricted range of bit 
rates is provided. Moreover, a certain interval between 35 
audio reproductions can be maintained even in an inex- 
pensive reproduction apparatus including no video data 
reproduction function. This allows title creators to create 
data easily. 

[0180] Since a plurality of still pictures, sub-picture 40 
and menu can be displayed synchronously with or asyn- 
chronously from high quality audio data, a wider variety 
of displays become available. 

(Example 3) 45 

[0181] An optical disk, and an apparatus and method 
for reproducing information from the optical disk in a 
third example accorcfing to the present invention will be 
described. Identical elements previously discussed in so 
the first example bear identical reference numerals and 
the descriptions thereof will be omitted. 

(1) Physical structure of the optical disk 

55 

[0182] The physical structure of the optical disk is 
identical with that described in the first and second 
examples and thus the description thereof will be omit- 



ted. 

(2) Logical structure of the optical disk 

[01 83] The logical structure of the optical disk is iden- 
tical with that described in the first and second exam- 
ples and thus the description thereof will be omitted. 

(3) Data structure of the audio zone region 32c 

[0184] Figure 43 shows a data structure of the audio 
zone region 32c. The audio zone region 32c stores an 
audio manager 900 and one or more audio title sets 800 
as in the first and second examples. The audio zone 
region 32c further stores an audio still video set 1100. 

(3.1) Data structure of the audio manager 900 

[0185] Figure 44 shows a data structure of the audio 
manager 900 in the third example. The structure of the 
audio manager 900 has the same structure as that 
shown in Figure 4 down to the stage of the audio man- 
ager information management table (AMGLMAT) 902. 
[0186] The audio manager information management 
table 902 includes an ASVS start address (ASVS_SA) 
904 for storing the position of the audio still video set. 
[01 87] The structure of the audio manager information 
management table 902 is the same as that shown in 
Figure 22 except for the ASVS start address 904. 

(3.2) Data structure of the audio still video set 1 100 

[01 88] Figure 45 shows a data structure of the audio 
still video set 1100. The audio still video set 1100 
includes a plurality of picture video objects (hereinafter, 
referred to as "P^VOBs") 1102, audio still video set 
management information (ASVSI) 1101 for managing 
the plurality of P_VOBs 1102, and audio still video set 
management information backup (ASVSI_BUP) 1103. 
In the following description, the audio still video set will 
generally be referred to as an "ASVS". The P_VOBs 
1 102 will generally be referred to as an "audio still video 
object (ASVOB)". 

(3.2.1) Data structure of the P_VOB 1102 

[01 89] Figure 46 shows a data structure of the P_VOB 
1102. The P_VOB 1102 includes a picture NV pack 
(P_NV_PCK) 1113, a plurality of video packs (V_PCK) 

1114, and a plurality of sub-picture packs (SP_PCK) 

1115. Only one picture NV pack 1113 is included in 
each P_VOB 1102. 

[0190] The video packs (V_PCK) 1114 each stores 
video data forming an MPEG-system I picture. Unlike 
the VOB, the P_VOB is for storing a still picture but not 
a moving picture. Accordingly, the video data stored in 
each video pack 1114 includes only an I picture which is 
in-frame-compressed. Herein, information indicating 
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one still picture corresponds to one I picture. 
[0191 ] The sub-picture packs 1115 can each store a 
plurality of sub-picture streams. Each sub-picture packs 
1115 stores an identification code for identifying the plu- 
rality of sub-picture streams. The sub-picture packs s 
1115 need not be included in the P_VOB 1 1 02. 
[0192] The picture NV pack 1113 includes a DSI 
packet (not shown) and a PCI packet (not shown). The 
DSI packet stores information used for special effects 
such as fast-forward reproduction. The PGI packet 10 
stores highlight information 1 120 (Figure 47). 
[0193] As shown in Figure 47, the highlight informa- 
tion 1120 includes ASV highlight general information 
1121 indicating an effective period, the number of but- 
tons, and the like of highlight; ASV button color informa- 75 
tion 1122 indicating the display color of the button; and 
an ASV button information table 1123 indicating a but- 
ton command and the like. 

[0194] Unlike in the case of DVD-Video, the P_VOB 
1 1 02 does not have any audio data. 20 

(3.2.2) Data structure of the audio still video set man- 
agement information 1101 

[01 95] As shown in Figure 45, the audio still video set 25 
management information 1101 includes audio still video 
unit information (ASVUI) 1131 for managing a plurality 
of still pictures as a group and an audio still video 
search pointer table (ASV-SRPT) 1132 for managing 
the recording position of the P_VOB 1 102. 30 
[0196] The audio still video unit information 1131 it 
information for managing a plurality of P_VOBs 1102 
having the same attribute and continuously recorded on 
the disk. Such information is a basic unit for reproduc- 
tion processing. 35 
[0197] The audio still video unit information 1131 
includes an "audio still video set ID (ASVSJD)"; a 
"number of audio still video sets (ASVSJMs)"; a 
"P_VOB start address (P_VOBS_SA)" and a "P_VOB 
termination address (P_VOBS_EA)" indicating the 40 
recording position of the P_VOB; an "ASVU attribute" 
indicating the ASVU attribute such as, for example, a 
compression method of the P_VOB, aspect mode and 
existence of a button; a "P_VOB sub-picture palette 
(P_VOBS_SP_PLT)" for determining the display color of 45 
the sub-picture; and "ASVU general information 
(ASVU_GI)" corresponding to each ASVU for designat- 
ing the P_VOB belonging to each ASVU. 
[0198] Each "ASVU general information (ASVUJ3I)" 
includes a "number of P__VOB (P_VOB_Ns)" indicating so 
the number of P_VOBs forming the ASVU, and a "start 
P_VOB number" indicating which P_VOBs belong to the 
ASVU. 

[01 99] The audio still video search pointer table 1 1 32 
stores a plurality of "ASV search pointer (ASV_SRPT)" ss 
indicating the start address of each of the P_VOBs 
included in the ASVS. 



(3.3) Data structure of the audio title set 800 

[0200] Figure 48 shows a data structure of the audio 
title set 800 in the third example. The audio title set 800 
shown in Figure 48 has the same structure as that 
shown in Figure 5 except for the structure of the PGC 
information 833. 

[0201] Figure 49 shows a data structure of the PGC 
information 833. The PGC information 833 includes 
"ATS PGC general information (ATS_PGC_GI)" f an 
"ATS program information table (ATS_PGIT)", an "audio 
cell reproduction information table (ATS_C_PBIT)", and 
an "ATS audio still video reproduction information table 
(ATS_ASV_PBIT)". 

[0202] The "ATS PGC general information 

(ATS_PGC_GI)" indudes the number of audio pro- 
grams and the number of cells included in the PGC 
information; the reproduction time period of the PGC; 
and information the ATS_PGC information. The address 
information on the ATS_PGC information includes 
pointer information to an "ATS program information 
table (ATS_PGIT)", an "audio cell reproduction informa- 
tion table (ATS_C_PBIT)", and an "ATS audio still video 
reproduction information table (ATS_ASV_PBIT)". 
[0203] The "ATS program information table 

(ATS_PGIT)" induces a plurality "audio program infor- 
mation (ATS_PGI)" units included in the PGC. 
[0204] Each of the plurality of "audio program informa- 
tion (ATS_PGI)" units includes "audio program physical 
allocation information" indicating whether or not the 
AOB of each program and the AOB of the previous pro- 
gram are recorded at physically discontinuous positions 
on the optical disk, "audio program time attribute infor- 
mation" indicating whether or not the time information of 
the AOB is continuous with the time information of the 
previous AOB. an "audio program start cell number" 
indicating the first cell number forming the audio pro- 
gram, "reproduction start audio cell time" indicating the 
first time information of the first audio cell included in the 
audio program, an "audio program total reproduction 
time period" indicating the reproduction time period of 
the audio program, an "audio pause time period" indi- 
cating the silent period until the reproduction of the 
audio cell of the audio program starts, an "ASVU 
number (ASVUN)" designating the still picture unit to be 
reproduced simultaneously with audio data of the pro- 
gram, an "audio still video display mode (ASV_DMOD)" 
indicating the display mode of the still picture such as, 
for example, whether the still picture or still pictures are 
displayed in accordance with the order or randomly, and 
an "ATS_ASV_PBI start address" and "ATS_ASV_PBI 
termination address" designating corresponding ATS 
audio still video reproduction information in the ATS 
audio still video reproduction information table 
(ATS_ASV_PBIT). 

[0205] Figure 50 shows a data structure of the "audio 
still video display mode (ASV_DMOD)". The "audio still 
video display mode (ASV_DMOD)" includes a "display 



BNSDOCID: <EP 0942609A1_!_> 



41 



EP 0 942 609 A1 



42 



timing mode" flag and a "display order mode" flag. 
[0206] The "display timing mode" indicates specific 
timing for reproducing the audio data and an updating 
time period of the still picture. 

[0207] When the "display timing mode" is Browsable, s 
the display timing of the still picture can be changed by 
interaction from the user. When the "display timing 
mode" is SlideShow, the display timing of the still picture 
cannot be changed by interaction from the user. 
[0208] When the "display timing mode" is SlideShow, 
the display of the still picture is updated in accordance 
with the display timing determined synchronously with 
the audio information which is simultaneously repro- 
duced. When the "display timing mode" is Browsable, 
the display of the still picture is updated when the deter- 
mined display continuation time period has passed, 
asynchronously from the audio information which is 
simultaneously reproduced. 

[0209] The "display order mode" indicates whether the 
still picture or still pictures are displayed sequentially in 
accordance with the display list, randomly or shuffle. A 
random display of the still picture refers to that the still 
picture or still pictures are selected randomly while per- 
mitting overlap and are displayed in conformity with the 
selected order. A shuffle display of the still picture refers 
to that the still picture or still pictures are selected ran- 
domly while prohibiting overlap and are displayed in 
conformity with the selected order. 
[0210] The "audio cell reproduction information table 
(ATS_C_PBIT)" includes a plurality of "audio cell repro- 
duction information (ATS_CJPBI)" units. Each "audio 
cell reproduction information unit (ATS_C_PBI)" 
includes a "cell index number" indicating the order of 
cells included in the program, a "cell type" indicating the 
attribute of the cell (i.e.. whether the cell is a still picture 
cell or a silent cell), a "cell start address" indicating the 
start address of the cell in the form of a relative address 
with respect to the first pack of the AOB of the ATS 
including the cell, and a "cell termination address" indi- 
cating the final address of the cell in the same manner. 
[0211] The "ATS audio still video reproduction infor- 
mation table (ATS_ASV_PBIT)" includes a plurality of 
"ATS audio still video reproduction information 
(ATS_ASV_PBI)" units. Each "ATS audio still video 
reproduction information unit (ATS_ASV_PBI)" includes 45 
a plurality of "display lists (DLIST)" indicating which 
P_VOBs stored in the audio still video set are repro- 
duced and in which order the P_VOBs are reproduced. 
[0212] Each "display list (DLIST)" includes an "ASV 
number" indicating the ordinal number (i.e. , first, second so 
or the like) of the P_VOB in the ASVU, an "initial button 
number (FOSL_BTNN)" indicating the number of a but- 
ton which is in the highlight information and selected in 
an initial state, an "audio program number" indicating 
the audio program corresponding to the DLIST, "display ss 
timing" indicating the timing for displaying the P_VOB 
included in the DLIST, and a "start section transition 
mode" and a "termination section transition mode" indi- 



cating the transition of the still picture. 
[0213] In this example, as the picture transition 
modes, "no transition", "fading from black", "fading into 
black", "cross-fading", and "wiping in several directions" 
are assumed. 

(3.4) Data structure of the AOB 802 

[0214] The AOB 802 is packetized into 2 Kbytes. The 
AOB 802 stores data in the LPCM format, AC3 format, 
DTS format or other compression formats. In the case 
of the LPCM format, the sample bit is 16, 20, or 24 bits, 
with a sampling frequency of 48 kHz. 96 kHz, 192 kHz, 
44. 1 kHz, 88.2 kHz, or 1 76.2 kHz. 
[0215] The AOB 802 has an exemplary structure as 
described in the second example with reference to Fig- 
ure 24. 

(4) Two reproduction methods 

[021 6] The audio zone region 32a stores three MPEG 
streams of a VOB, an AOB and a P_VOB. The AOB is 
included in the audio title set (ATS). The P_VOB is 
included in the audio still video set (ASVS). The VOB is 
included in the audio manager (AMG) as shown in Fig- 
ure 44. 

[021 7] The AOB and the P_VOB are the main informa- 
tion stored on the disk, and are typically music informa- 
tion with a still picture. The VOB is typically a menu for 
allowing the user to select any music information with a 
still picture. 

[0218] Among the MPEG streams stored in the audio 
zone region 32c, the VOB is reproduced by a standard 
method specified by MPEG. The AOB and the P_VOB 
are reproduced by an expanded different method which 
is different from the standard method. 

(4.1) Standard reproduction method 

[021 9] First, the standard reproduction method will be 
described. This method is used for reproducing a VOB 
stored on the optical disk 

[0220] Figure 51 shows a data structure of a system 
stream in the third example conforming to MPEG. The 
optical disk stores at least one system stream. 
[0221] The system stream can include a plurality of 
elementary streams. The elementary streams include a 
video elementary stream storing moving picture infor- 
mation, an audio elementary stream storing audio infor- 
mation, and a sub-picture elementary stream storing 
sub-picture information. The plurality of elementary 
streams are each divided into a plurality of packs. 
[0222] The system stream shown in Figure 51 
includes a video elementary stream, an audio elemen- 
tary stream, and a sub-picture elementary stream. The 
video elementary stream is divided into one or more 
video packs. The audio elementary stream is divided 
into one or more audio packs. The sub-picture elemen- 
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tary stream is divided into one or more sub-picture 
packs. In Figure 51, the letter "V" represents a video 
pack, the letter "A* represents an audio pack, and the 
letter "S n represents a sub-picture pack 
[0223] As shown in Figure 51, each pack includes a 5 
"pack header", a "packet header" and a "data field". The 
data length of one pack is 2Kbyte. 
[0224] Described in the "pack header" is data con- 
forming to MPEG including, for example, a pack start 
code, an SCR (System Clock Reference), and an MUX 10 
(Multiplex) rate. 

[0225] Described in the "packet header" is data con- 
firming to MPEG including, for example, a stream ID, a 
packet length, an STD (System Target Decoder), a 
buffer scale size, and a PTS (Presentation Time is 
Stamp). 

[0226] The stream ID described in the "packet header" 
indicates whether data stored in the "data field" is a 
video elementary stream or any other elementary 
stream. Any other elementary stream is an audio ele- 20 
mentary stream, a sub-picture elementary stream or 
management information. 

[0227] The management information is referred to as 
an "NV pack". The management information includes a 
PCI packet and a DSI packet. The PCI packet stores 25 
highlight information for receiving and processing user 
interaction. The DSI packet stores information utilized 
for special effects such as, for example, fast-forward 
reproduction. 

[0228] In this example, the data field of the elementary 30 
streams other than the management information stores ~ 
a sub-stream ID. The sub-stream ID is an 8-bit field indi- 
cating whether the data stored in the data field is audio 
data or sub-picture data. The sub-stream ID further indi- 
cates an - encode type and a channel identification 35 
number. When the data stored in the data field is audio 
data, the encode type is one of the linear PCM format, 
AC-3 format, DTS format and the like, and the channel 
identification number is one of #0 through #7 (8 types 
max.). When the data stored in the data field is sub-pic- 40 
ture data, the channel identification number is one of #0 
through #31 (32 types max.). 

[0229] In the "data field", data compressed in accord- 
ance with a prescribed format is recorded. For example, 
video data is compressed by MPEG2 and recorded in 45 
the data field. Audio data is compressed by the linear 
PCM, AC-3 or DTS format and recorded in the data 
field. Sub-picture data is compressed by the run-length 
coding and recorded in the data field. 
[0230] The PTS in the "pack header" and the "packet so 
header" is used to synchronize the decoding of the 
audio pack and the decoding the sub-picture pack 
[0231] In the case where the reproduction apparatus 
reproduces a system stream recorded on the optical 
disk, the reproduction apparatus sets the SCR included 55 
in the leading pack of the system stream as an initial 
value of the STC, which is a reference clock, and inputs 
each pack to the corresponding decoder at time timing 



indicated by the SCR while referring to the STC. Each of 
the decoded packs is externally output from the decoder 
at the timing indicated by the PTS. By outputting each 
pack at the timing indicated by the PTS included in the 
pack, the synchronisation of the audio, video and sub- 
picture outputs is guaranteed. 

[0232] As described above, in the case of using the 
standard reproduction method, the system stream to be 
reproduced is multiplexed with audio information, video 
information and sub-picture information. The system 
streams are reproduced by resetting the STC, which is 
a reference clock, of each system stream and externally 
outputting the audio, video and sub-picture data from 
the decoder at the timing indicated by the PTS included 
in each pack. Thus, the audio data, video data and sub- 
picture data are reproduced synchronously with one 
another. 

(4.2) Reproduction method of AOB and P_VOB 

[0233] Hereinafter, a reproduction method of an AOB 
and a P_VOB recorded on the optical disk will be 
described. 

[0234] Figure 52 shows the relationship among the 
P_VOB, ASVU, Program and AOB. 
[0235] An ATS can include a plurality of PGC informa- 
tion (ATS_PGCI) units. In the example shown in Figure 
52, the ATS includes one PGC information unit 
(ATS_PGCI). 

[0236] Described in the PGC information is one or 
more audio program information (ATS_PGI) units (e.g., 
Program #1, Program #2. ... Program #L). The audio 
program information is an audio data access unit. 
[0237] The audio program information includes one or 
more audio ceil reproduction information (ATS_C_PBI) 
units. The audio cell reproduction information unit is a 
minimum management unit. The audio cell reproduction 
information points to the position in the audio data 
stored in the AOB. Thus, the audio data to be repro- 
duced in relation to the audio cell reproduction informa- 
tion is determined. 

[0238] Also described in the PGC information is one 
or more ATS audio still video reproduction information 
(ATS_ASV_PBI) units. The ATS audio still video repro- 
duction information defines the mode in which the still 
picture is reproduced on a program-by-program basis. A 
plurality of audio program information units can share 
one ATS audio still video reproduction information unit. 
In the example shown in Figure 52, Program #1 and 
Program #2 share ATS_ASV_PBI#1 . 
[0239] The audio still video reproduction information 
can be shared when a plurality of audio program infor- 
mation units point to a common entry in the ATS audio 
still video reproduction information table 
(ATS_ASV_PBIT). 

[0240] Figure 69 shows the relationship between the 
DLIST and P_VOB. 

[0241 ] The ATS_ASV_PBI includes a plurality of dis- 
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play lists (DLIST),and each DLIST points to the still pic- 
ture data (P_VOB) to be displayed. Usually, the still 
picture data (P_VOB) is reproduced in accordance with 
the order of the DLIST. The ATS_ASV_PBI can be des- 
ignated by the unit of the program of the ATS_PGCI. 5 
The same ATS_ASV_PBI can be designated by contin- 
uous programs. The P_VOB can be designated by the 
ASVU, which is a reading unit. The same ASVU is des- 
ignated by the programs designating the same 
ATS_ASV_PBI. 

[0242] In other words, continuous programs can be 
designated so as to use the same ATS_ASV_PBI with 
respect to the same ASVU. This is referred to as the 
ASVU range. 

[0243} During the reproduction of the programs in the 
same ASVU range, the same ASVU is used. Accord- 
ingly, the still picture is not read and thus the reproduc- 
tion of audio data is not interrupted. The DLIST of the 
ATS_ASV_PBI can designate an arbitrary P_VOB 
which is included in the same ASVU. Accordingly, the 
same P_VOB can be designated a plurality of times. 
The P_VOB in the ASVU need not be designated in the 
DLIST. 

[0244] Accordingly, different still pictures can be repro- 
duced while using the same ASVU among different pro- 
grams, although the ATS_ASV_PBI needs to be 
changed. 

[0245] The audio program information designates the 
ASVU recorded in the ASVS in order to designate a plu- 
rality of still pictures (P_VOB) to be displayed simultane- 
ously with the reproduction of the AOB. 
[0246] The ASVU defines the unit by which the repro- 
duction apparatus buffers the still picture data. In other 
words, the still picture data in the designated ASVU is 
read before one audio program information unit is repro- 
duced. Accordingly, the reproduction apparatus need 
not read the still picture data from the optical disk while 
reproducing a music program. 

[0247] When reproducing the next music program 
information, the reproduction apparatus determines 
whether the plurality of pieces of still picture data indi- 
cated by the required ASVU exists iri the main memory 
area or not. In the case where the plurality of pieces of 
still picture data of ASVU used in the previous audio 
program information exist in the main memory area, the 
reproduction apparatus need not read the still picture 
data from the optical disk. 

[0248] The disk title creators can allow the reproduc- 
tion apparatus to switch from one audio program infor- 
mation unit to the next audio program information unit 
rapidly by causing one ASVU to be shared by the two 
audio program information units. The reproduction 
apparatus can perform rapid switching from one audio 
program unit into the next audio program unit when the 
successive audio program units share the ASVU. 
[0249] Each ATS_ASV_PBI manages the display 
order of the still pictures to be displayed by the display 
list (DLIST). In the display list (DLIST), the still picture to 



be displayed is designated by the ASV number. The 
ASV number is used for specifying the still picture 
included in the ASVU. 

[0250] Thus, the PGC information allows the still pic- 
ture to be designated for each audio program informa- 
tion unit. Alternatively, the same still picture can be 
displayed for a plurality of audio program information 
units. Even when the audio program information is 
changed, the display can be changed without muting 
the audio data. 

[0251 ] Figure 53 shows an exemplary PGC forming a 
title. In this example, the PGC includes four audio pro- 
grams. Audio program #1 corresponds to AOB #1 , audio 
program #2 corresponds to AOB #2, audio programs #3 
and #4 correspond to AOB #3. On the recording 
medium, the AOBs are recorded in the order of AOB#1 , 
AOB#2 and AOB#3. Audio programs #1, #2 and #3 
each has a silent cell, and audio programs #2 has two 
music cells. 

[0252] Audio programs #1 and #2 represent ASVU#1 , 
and audio programs #3 and #4 represent ASVU#2. 
P_VOB #1 is displayed in the cell #2, P_VOB #2 is dis- 
played in the cell #4, P_VOB #3 is displayed in the cell 
#5. P__VOB #4 is displayed in the cell #7, and P_VOB #5 
is displayed in the cell #8. 

[0253] In the case where all the audio cells each have 
a reproduction time period of 60 seconds (5.400.O00 in 
PTS) and all the still picture cells each have a reproduc- 
tion time period of 1 second (90,000 in PTS), audio pro- 
gram information can be described as shown in Figure 
54. Still picture program information can be described 
as shown in Figure 55. 

[0254] Under the conditions that the audio data is 48 
kHz, 16 bit sampling with 2 channels, the number of the 
silent packs is 96 and the number of the audio cell packs 
is 5760 as shown in Figure 56. Where the number of still 
picture packs is 50, the ASV search pointer is as shown 
in Figure 57. The ASVU general information is as shown 
in Figure 58. 

[0255] In this example, the address information on the 
ASV search pointer is a relative address with respect to 
the recording position of the first P_VOB, but can be in 
any other form which specifies the position of the 
P_VOB on the optical disk. In the case where, for exam- 
ple, the ASVU generation information includes address 
information acting as the reference, the address infor- 
mation of the ASV search pointer can be the relative 
address with respect to the position indicated by the 
address information in the ASVU general information. 
[0256] When the audio still video display mode is 
"SlideShow" as in the case of programs #1 and #2 
shown in Figure 54, the still picture reproduction is 
reproduced in synchronization with the audio data. 
Since the ASVU number of the programs #1 and #2 is 
"f, ASVU #1 is read to the buffer before the reproduc- 
tion of the programs. Then, as shown in Figure 55, the 
still picture information #1 is referred to from the 
address information of the ATS_ASV_PBI to start the 
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display of still pictures in display lists #1, #2 and #3 in 
the ATS audio still video reproduction information #1 at 
the time indicated by the "audio program number" and 
"display timing" of the display lists. The display timing is 
synchronous with the reproduction time period of the 5 
audio information to be reproduced with the still pic- 
tures. 

[0257] More specifically, the still picture in the display 
list #1 of the ATS audio still video reproduction informa- 
tion #1 is displayed when the PTS becomes 90,000 10 
while the AOB #1 is being reproduced. The still picture 
in the display list #2 of the ATS audio still video repro- 
duction information #1 is displayed when the PTS 
becomes 90,000 while the AOB #2 is being reproduced. 
The still picture in the display list #3 of the ATS audio still 15 
video reproduction information #1 is displayed when the 
PTS becomes 5,580,000 while the AOB #2 is being 
reproduced. 

[0258] As described above, when the audio still video 
display mode is "SlideShow". the still picture is dis- 20 
played at prescribed timing synchronously with the 
reproduction of the aucfio data. The display timing can- 
not be changed by the interaction from the user (e.g., 
selection of a specific button by the mouse). 
[0259] Figure 59A shows the relationship between the 25 
reproduction timing of the audio data and the display 
timing of the still picture when the audio still video dis- 
play mode is "SlideShow". 

[0260] By contrast, when the audio still video display 
mode is "Browsable" as in the case of programs #3 and 30 
#4 shown in Figure 54, the still picture is reproduced 
asynchronously from the audio data. Since the ASVU 
number of the programs #3 and #4 is "2", the ASVU #2 
is read to the buffer before the reproduction of the pro- 
grams. Then, still picture information #2 is referred to 35 
from the address information of the ATS_ASV_PBI. The 
display timing of the still pictures in display lists #1 and 
#2 in the ATS audio still video reproduction information 
#2 can be changed by the interaction from the user 
(e.g., input of a command by the user). 40 
[0261] In the case where no interaction from the user 
is generated within the period indicated by the "display 
timing" of the display list, the time period indicated by 
the "display timing" is considered as the display contin- 
uation time period. When the display continuation time 45 
period has passed, the display of the next still picture is 
started. 

[0262] It is possible to designate with the audio stiii 
video display mode whether the still pictures are 
sequentially displayed in accordance with the display so 
list, the still pictures randomly selected while permitting 
overlap are reproduced (random reproduction), or the 
still pictures randomly selected while prohibiting overlap 
are reproduced (shuffle reproduction). 
[0263] Practically, when the reproduction starts with ss 
program #3, ASVU #2 is loaded to the buffer since the 
ASVU number of program #3 is "2". Then, the DLIST 
indicated by the ATS audio still video reproduction infor- 



mation #2 is referred to from the address information of 
the ATS_ASV_PBl. Since the audio program number of 
display list #1 is "3". P_VOB #4 corresponding to the 
ASV number of the display list #1 is displayed. When a 
key corresponding to the Next_DLIST provided on the 
remote controller is pressed, the still picture of DLIST 
#2. i.e.. P_VOB #5 indicated by the ASV number of 
DLIST #2 is displayed. 

[0264] The still pictures in the DLISTs are displayed in 
accordance with the "display effect" (i.e., the start sec- 
tion display effect and termination section display effect, 
see Figure 55). When no user operation is performed 
within the time of 5,490,000 indicated by the display tim- 
ing of display list #1. i.e., about 1 minute (matching to 
the reproduction time period of program #3), the still pic- 
ture of the next DUST, i.e., P_VOB #5 is displayed in 
accordance with the "display effect" and the "transition 
time". The audio data of program #4 is reproduced in 
relation with the display of P_VOB #5. 
[0265] In the case where the reproduction time period 
of program #3 is shorter than the time period indicated 
by the display timing of P_VOB #4, the display of 
P_VOB #4 is terminated and the display of P_VOB #5 is 
started while program #3 is being reproduced. In the 
case where the display time period indicated by the dis- 
play timing of P_VOB #4 is longer than the reproduction 
time period of program #3. the display of P_VOB #5 is 
started after the reproduction of program #4 is started. 
[0266] When the reproduction is started with program 
#4, ASVU #2 is read as in the case of starting with pro- 
gram #3 to display the PJVOB in the first DLIST havmg 
the audio program number of "4". Thereafter, the display 
of the still pictures can be changed in accordance with 
the user operations or command. The audio program 
number of the DLIST is effective only for determining 
the P_VOB to be displayed first only when the ASVU is 
changed. When the reproduction is continued, a user 
operation or command has priority over the audio pro- 
gram number. When the P_VOB is displayed while the 
time period indicated by the display timing, the P_VOB 
indicated by the next DLIST is displayed. 
[0267] When the value indicated by the display timing 
refers to "infinity", the corresponding P_VOB is continu- 
ously displayed until the user operation or execution of 
a command is performed. As the value indicating the 
infinity, the maximum value (e.g., all the bit values are 
"1 ") can be used but other values are also usable. When 
the audio still video display mode is "Browsable", the 
display time period of the P_VOB can always be infinite. 
[0268] Figure 59B shows the relationship between the 
reproduction timing of the audio data and the display 
timing of the still picture when the audio still video dis- 
play mode is "Browsable". 

[0269] In this example, information required for each 
audio programs is recorded in the audio program infor- 
mation (ATS_PGI). When the information is found to 
correspond to each audio program, the information can 
be recorded in any area. For example, as shown in Fig- 
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ure 60, the ASVU number, audio still video display 
mode, ATS_ASV_PBI start address, and 
ATS_ASV_PBI termination address are recorded in the 
audio still video reproduction information search pointer 
<ATSJ>G_ASV_PBI_SRP). 5 
[0270] As described above, the AOB and P_VOB are 
reproduced simultaneously by decoders unlike in the 
case of the VOB. 

[0271] Hereinafter, the reproduction method of the 
VOB and the reproduction method of the AOB and io 
P_VOB will be compared. 

[0272] The two reproduction methods are identical 
with each other in that video information is output along 
with the audio information. It should be noted that in the 
case of the P_VOB, the video information is still picture is 
information including an I picture. 
[0273] One difference between the two reproduction 
methods is the recording position of the data. In the 
case of the VOB, audio information and video informa- 
tion are multiplexed together and recorded on the opti- 20 
cat disk as one system stream, whereas the AOB and 
the P_VOB are recorded in different areas of the optical 
disk as different system streams. 
[0274] Another difference between the two reproduc- 
tion methods is the synchronization method of audio 2s 
data and still picture data. VOBs are reproduced by a 
decoder one by one as described above. In more detail, 
the STC. which is a reference clock, is reset by the SCR 
in the "pack header*, and the STC is referred to output 
both audio information and still picture information at the 30 
timing of the PTS in the "packet header" in each pack. 
By contrast, AOBs and P_VOBs are reproduced by 
decoders simultaneously. As described in more detail 
below, a plurality of prescribed P_VOBs to be repro- 
duced synchronously with the AOB are read from the 35 
disk and input to the decoder before the AOBs. Unlike 
the VOBs, the P_VOBs input to the decoder are not dis- 
played immediately but are held in the decoder. Regard- 
ing each P_VOB, it is considered that the STC is reset 
with the SCR of the leading pack in accordance with the 40 
MPEG decoding model and the P_VOB is decoded in 
accordance with the PTS in the packet, so that data 
after the plurality of prescribed P_VOBs to be repro- 
duced synchronously with the AOB is accumulated. 
Actually, the data regarding the P_VOB before being 45 
decoded is accumulated in an appropriate buffer. When 
the P_VOBs are completely input into the decoder, the 
AOB is read from the optical disk and input to the 
decoder. When the AOB is input to the decoder, the 
STC, which is a reference clock, is reset by the SCR in so 
the leading pack, and the AOB is output at the timing of 
the PTS in each pack header referring to the STC as in 
the case of the VOB. 

[0275] Information on the actual output timing of the 
P_VOBs is recorded in another area of the optical disk ss 
(i.e., DUST as a part of the PGC information). The infor- 
mation on the output timing is separately supplied to the 
decoder. Each P_VOB in the decoder is not output syn- 



chronously with the STC but at the display timing 
recorded in the DLIST. In other words, in reproducing 
the AOB, the PTS which is included in the AOB is used 
to determine the display timing of the AOB; and in repro- 
ducing the P_VOB, the information which is separately 
recorded from the P_VOB is used to determine the dis- 
play timing of the P_VOB. 

[0276] The audio zone region and the DVD as a mul- 
timedia optical disk have been described. Next, a repro- 
duction apparatus in this example for reproducing a 
media having the above-described structure will be 
described. 

[0277] The basic structure of the reproduction appa- 
ratus in this example is the same as those described in 
the first and second examples. As in the second exam- 
ple, a key for separately controlling audio data and 
video data is provided in a remote controller for operat- 
ing the DVD player or is provided on a front panel of the 
DVD player. With such a key, the video data can be 
skipped forward and backward or rewound to the start of 
the video data while continuously reproducing the audio 
data. 

[0278] Figure 61 is a block diagram illustrating an 
internal structure of a DVD player 90 in the third exam- 
ple. The DVD player 90 is different from the DVD player 
70 shown in Figure 8 in the structure of the AV decoder 
section, which will be described below. 
[0279] In the third example, usable streams include an 
AOB as audio data, and a P_VOB as video data, and a 
VOB for menu or the like obtained by multiplexing audio 
and video data. An AV decoder section 85c includes a 
P_VOB system decoder 103, an AOB system decoder 
104, and a VOB system decoder 108. The AV decoder 
section 85c has a structure to operate the system 
decoders 103, 104 and 108 synchronously with and 
asynchronously from one another. 
[0280] The P_VOB system decoder 103 receives a 
P_VOB stream and distinguishes a stream ID and feub- 
stream ID in a header of each packet included in the 
stream, thereby classifying the data into a video packet, 
a PCI (highlight) packet, and a sub-picture packet. 
[0281] The AOB system decoder 104 receives sin 
AOB stream and classifies the data on a packet header- 
by-packet header basis. 

[0282] The VOB system decoder 1 08 receives a VOB 
stream and distinguishes a stream ID and sub-stream 
ID in a header of each packet included in the stream, 
thereby classifying the data into an audio packet, a 
video packet, a PCI (highlight) packet, and a sub-picture 
packet: 

[0283] The AV decoder section 85c still includes a 
highlight buffer 94 for temporarily storing highlight infor- 
mation of a PCI packet from the P_VOB system 
decoder 103 and the VOB system decoder 108, a video 
buffer 96 for temporarily storing a video packet from the 
P_VOB system decoder 103 and the VOB system 
decoder 108, a sub-picture buffer 105 for temporarily 
storing a sub-picture packet from the P_VOB system 
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decoder 103 and the VOB system decoder 108, ah 
audio buffer 99 for temporarily storing an audio packet 
from the AOB system decoder 104 and the VOB system 
decoder 108, a highlight decoder 95 for decoding the 
highlight data from the highlight buffer 94 and outputting 5 
the decoded highlight data to the system control section 
93, a video decoder 87 for decoding the video data from 
the video buffer 96, a sub-picture decoder 98 for decod- 
ing the sub-picture data from the sub-picture buffer 1 05, 
an audio decoder 100 for decoding the audio data from 10 
the audio buffer 99 and outputting the decoded audio 
data as an audio output, a video synthesizer 101 for 
synthesizing the decoding results of the video decoder 
87 and the sub-picture decoder 98 into one piece of 
video data, and a synchronization adjusting section 1 02 75 
for managing the synchronization of the highlight 
decoder 95, the video decoder 87, the sub-picture 
decoder 98 and the audio decoder 100. 
[0284] The reproduction apparatus having the above- 
described structure operates in, for example, the follow- 20 
ing manner. The operation up to the reading of the 
stream is the same as described in the first and second 
examples and will not be described below. 
[0285] Figures 62 through 66 are schematic flow- 
charts illustrating an operation for reproducing the PGC 25 
information. 

[0286] For starting the reproduction, the system con- 
trol section 93 determines whether or not the DVD 
player 90 is in a video-oriented reproduction mode, by a 
reproduction mode determination section. When it is 30 
determined that the DVD player 90 is In the video-ori- 
ented reproduction mode, the system control section 93 
reads the video manager based on the information 
which is read from the volume file management region. 
The system control section 93 refers to the PGC man- 35 
agement information table for the video manager menu 
to calculate the recording address of the PGC for the 
volume menu. The resultant PGC is reproduced and 
retained inside. When the PGC for the volume menu is 
retained inside, the system control section 93 refers to 40 
the retained PGC information to calculate the video 
object (VOB) to be reproduced and the recording 
address of the VOB on the optical disk. When the VOB 
to be reproduced is determined, the system control sec- 
tion 93 outputs a control signal to the mechanism con- 45 
trol section 83 and the signal processing section 84 so 
as to retrieve the determined VOB from the optical disk 
for reproduction. When the system stream forming the 
VOB is input to the AV decoder section 85c, an instruc- 
tion indicating "reproduce VOB M as a decode stream so 
instruction is given to the AV decoder section 85c. 
[0287] The VOB input to the AV decoder section 85c 
is divided into streams forming the VOB. i.e., an audio 
pack, video pack, sub-picture pack, and NV pack by the 
VOB system decoder 1 08, and the streams are respec- ss 
tively input to the audio buffer 99, the video buffer 96, 
the sub-picture buffer 105, and the highlight buffer 94. 
The highlight buffer 95 buffers only the highlight infor- 



mation in the PCI packet in the NV pack. Data input to 
each buffer is input to the audio decoder 100, the video 
decoder 87, the sub-picture decoder 98 or the highlight 
decoder 95 to be decoded Audio data for menu is output 
from the audio decoder 1 00. The outputs from the video 
decoder 87 and the sub-picture decoder 98 are synthe- 
sized by the video synthesizer 101 and output as video 
data. The output from the highlight decoder 95 is read to 
the system control section 93 and the button information 
or the like of each menu is stored in the memory in the 
system control section 93. 

[0288] Thus, a video menu from which the user can 
select the title to be reproduced (i.e.., audio program) is 
displayed (see Figure 40). 

[0289] The user sees the menu and selects and con- 
firms the title that he/she is interested in by designating 
the item number in the menu using the remote control- 
ler. Then, the system control section 93 receives the 
designation of the item number in the menu from the 
remote controller and refers to the button information of 
the highlight information of the PCI pack included in the 
VOB of the video menu which is being reproduced, the 
VOB being stored in the memory, to execute a control 
command corresponding to the designated number. 
The control command is PlayProgram #n or the like, and 
the title number to be reproduced is designated by "n". 
[0290] As an execution operation by the PlayProgram 
command, the system control section 93 refers to the 
title search pointer table, which is a part of the audio 
manager, to determine the audio title set (ATS) to which 
the designated title belongs to and the title number m 
the ATS. When the ATS is confirmed, the system control 
section 93 outputs a control signal to the mechanism 
control section 83 and the signal processing section 84 
so as to reproduce the ATS management information of 
the confirmed title set and sequentially retrieves ATS 
PGC information search pointer, which is a part of the 
ATS management information, to determine the PGC 
information including the audio program to be repro- 
duced. 

[0291 ] When the PGC information is determined, the 
system control section 93 outputs a control signal to the 
mechanism control section 83 and the signal process- 
ing section 84 so as to reproduce the determined PGC 
information from the disk and retain the information in 
the inside buffer memory for the PGC information. 
[0292] When the PGC information is retained, the sys- 
tem control section 93 reproduces the PGC information 
in accordance with the flowchart shown in Figure 62. It 
is assumed that the audio program to be produced is 
already determined by the above-described menu com- 
mand or direction designation using the remote control- 
ler. The system control section 93 refers to the retained 
PGC information to first acquire the ASVU number of 
the first audio program information and compares the 
number with the ASVU number previously reproduced. 
When the ASVU number is changed, the system control 
section 93 reads all the still picture data, i.e., P_VOBs 
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included in the ASVU based on the ASVU general infor- 
mation corresponding to the ASVU number and inputs 
the number into the AV decoder section 85c. Simultane- 
ously, an instruction indicating "reproduce P_VOB" is 
given to the AV decoder section 85c as a decode 5 
stream instruction from the system control section 93. 
The AV decoder section 85c separates the input 
P_VOBs on a pack-by-pack basis or packet-by-packet 
basis by the P_VOB system decoder 92 and stores the 
P_VOBs in corresponding buffer memories. 10 
[0293] At this point, the size of the video buffer 96 
needs to be larger than the sum of the sizes of the 
P_VOBs of the ASVU. For any reproduction apparatus, 
a maximum value of the total data amount of the 
P_VOBs included in the ASVU is set in order to guaran- is 
tee the reproduction. The video buffer 96 in any repro- 
duction apparatus needs to have a size accommodating 
the maximum value. In this example, the maximum 
value of 2 MByte. Since an I picture is 100 to 200 Kbyte, 
10 to 20 I pictures can be reproduced along with the 20 
audio information. 

[0294] When all the P_VOBs forming the ASVU are 
completely input to the decoder, the highlight decoder 
95, the video decoder 87, and the sub-picture decoder 
98 do not perform a decoding operation. In order to 25 
store all the data regarding the corresponding ASVU 
into the buffers before the audio data is reproduced, the 
highlight buffer 94, the video buffer 96, and the sub-pic- 
ture buffer 105 need to have a greater buffer capacity 
than the buffer capacity specified by MPEG2 or DVD- 30 
Video, 

[0295] When all the P__VOBs are completely stored in 
the corresponding buffers, the AV decoder section 85c 
and the audio decoder 100 are set to be in a decodaWe 
state by the audio encoding mode in the ATS PGC infor- 35 
mation search pointer 832 corresponding to the desig- 
nated PGC information for preparation of the 
reproduction of the audio data of the designated PGC. 
Then, the system control section 93 prepares for the 
display of the still picture. The system control section 93 *o 
determines the ATS audio still video reproduction infor- 
mation from the ATS_ASV_PBi start address of the 
audio program information. 

[0296] When the audio still video display mode of the 
audio program information is Browsable, the first DUST 45 
of the ATS audio still video reproduction information 
determines the first still picture to be reproduced. When 
the audio still video display mode is SlideShow, the 
DLIST having the earliest display timing among a plural- 
ity of DLISTs of the ATS audio still video reproduction so 
information in the range of audio programs to be repro- 
duced is selected. When the DLIST is determined, the 
system control section 93 displays the still picture, i.e., 
the P__VOB designated by the ASV number of the 
DLIST. Specifically, the system control section 93 out- ss 
puts display control information of the still picture infor- 
mation to the synchronization adjusting section 102 in 
the AV decoder section 85c. The display control infor- 



mation includes a wait period until the I picture of the 
P_VOB to be updated is designated and the update 
thereof is displayed. When the display control section is 
input, the synchronization adjusting section 102 
instructs the video decoder 87 to output the designated 
I picture at the designated timing in accordance with the 
display control information. Thus, the P_VOB is dis- 
played. Unlike the AOB and VOB t the P_VOB is output 
at the display timing recorded in a separate area. 
[0297] When the display of the P_VOB is started, the 
audio data designated by the audio program is repro- 
duced simultaneously. Simultaneously, the key opera- 
tion by the user is accepted. 

[0298] The reproduction of the audio data designated 
by the audio program is performed as shown in Figure 
63. First, the cell number (Cn) to be reproduced is deter- 
mined by the ATS program information table and the 
ATS cell reproduction information table. Here, since this 
is the start of the operation, the cell number is 1 . When 
the cell type of the cell to be reproduced is a silent cell, 
processing for reproducing a silent cell is performed. 
This requires only the effective silent period cone- 
sponding to the reproduction time period of the silent 
cell. When the cell is not a silent cell, the cell is an audio 
cell. Accordingly, the reproduction of the audio cell 
shown in Figure 65 is performed. In order to output the 
audio data from the leading end of the audio cell, the 
AOB to be reproduced and the recording address 
thereof are determined based on the audio cell repro- 
duction information, and a control signal is output to the 
mechanical control section 83 and to a signal process- 
ing section 84. Thus, the reproduction of the determined 
AOB is prepared. The AOB read from the optical disk is 
input to the AV decoder section 85c. Simultaneously, 
the system control section 93 gives an instruction indi- 
cating "reproduce AOB" as a decode stream instruction 
to the AV decoder section 85c. The AOB is decoded 
into an audio stream by the AOB system decoder 104 
and input to the audio buffer 99 and then to the audio 
decoder 100. Thus, the AOB is prepared to be output as 
audio data. Once the AOB is prepared, the STC, which 
is a reference clock, of the synchronization adjusting 
section 102 is reset by the SCR of the leading pack of 
the AOB, and the audio data in the AOB is output at the 
timing of the PTS stored in the packet header. When the 
output of the AOB is started, highlight information and 
sub-picture information for the leading I picture of the 
P_VOB and the menu items displayed in the state of 
overlapping with the leading I picture are decoded and 
output. Although it is not guaranteed that all the packets 
of the AOB include a PTS, the audio decoder outputs 
the audio data while complementing the PTS. 
[0299] When the audio still video display mode is 
Browsable while the audio data is being output and the 
display timing of DLIST corresponding to the P_VOB 
which is being displayed reaches the PTS of the audio 
data which is being reproduced, K is determined that the 
display time period of the still picture has passed and 
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thus the still picture of the next DLIST is reproduced. 
When the audio still video display mode of the audio 
program is SlideShow and the audio program number of 
the next DLIST is not the same as the audio program 
number of the DLIST which is being reproduced, it is 5 
determined that there is no DLIST to be reproduced and 
thus the still picture displayed is continuously displayed. 
When the next DLIST is the same as the audio program 
number of the DLIST which is being reproduced, the still 
picture of the next DLIST is reproduced when the dis- 10 
play timing of the next DLIST reaches the PTS of the 
audio data which is being reproduced. 
[0300] Figure 66 is a f lowchart illustrating the repro- 
duction of the second and later still pictures. First, the 
termination transition mode of the DLIST of the still pic- is 
ture which is being displayed is acquired. When the ter- 
mination transition mode is "no transition", the display is 
terminated. When the termination transition mode is not 
"no transition", i.e., when the transition effect is desig- 
nated, the still picture is erased by the designated tran- 20 
sition. The designated transition is, for example, fading, 
by which the color becomes gradually lighter. It should 
be noted that the transition effect may be ignored or 
replaced with another effect unless the reproduction 
apparatus has a capability of performing the designated 25 
effect. 

[0301] When the still picture displayed is erased, the 
start section transition mode of the DLIST of the next 
still picture is acquired. When the transition mode is "no 
transition", the next still picture is immediately dis- 30 
played. When the transition mode is not "no transition", 
i.e., when the transition effect is designated, the next 
still picture is displayed with an effect. Thus, the still pic- 
tures are switched. At this point, the highlight informa- 
tion and the sub-picture information already read in the 35 
buffer are decoded as well as the video information 
included in the P_VOB. 

[0302] The processing in response to the key opera- 
tions by the user is performed as shown in Figure 64. 
When the key operation accompanies a change of the 40 
audio program, i.e.. when the key operation is for Next- 
Program (reproduction of the next audio program), 
P rev Program (reproduction of the previous audio pro- 
gram), reproduction of the designated audio program, 
reproduction of the audio program by a command after 4s 
the menu is retrieved, or a change of the audio cell, the 
audio cell of the audio program to be reproduced is 
changed and processing for reproducing the audio cell 
is performed (Figure 65). 

[0303] When the key operation by the user is for a so 
change of only the still picture, i.e., the key operation is 
for NextDLIST (display of the next DLIST), PrevDLIST 
(display of the previous still picture), or display of the 
designated DLIST. the designated stifl picture is 
selected and processing for reproducing the still picture ss 
is performed only when the audio still video display 
mode of the designated still picture is Browsable (Figure 
66). 



[0304] In response to other key operations, the desig- 
nated operation is performed without changing the 
audio output or still picture output Such operations 
include display of the reproduction state and switching 
of the time display mode. In order to reproduce another 
stream (VOB) such as retrieval of the menu, the audio 
output and still picture display are terminated and the 
apparatus is switched to decode the VOB. 
[0305] Thus, all the titles are reproduced and the oper- 
ation is stopped. In accordance with specific setting of 
the DVD player or the DVD, only one title can be repro- 
duced before the operation is stopped, or a menu can 
be displayed after reproduction of one or more titles is 
completed. 

[0306] Hereinafter, the control of the video decoder 
and video buffer performed by the synchronization 
adjusting section 1 02 will be additionally described. 
[0307] Since data of a still picture is already stored in 
the buffer memory, the display of the still picture is man- 
aged based on the position in the buffer memory at 
which the still picture of the designated cell of the desig- 
nated program is recorded or based on the ordinal 
number (i.e., first, second, etc.) of the still picture with 
respect to the start. When the management is per- 
formed based on the position in the buffer memory, the 
data is sent to the video decoder 87 from the designated 
address in the video buffer 96 and displayed by an 
instruction of the synchronization adjusting section 102. 
When the management is performed based on the ordi- 
nal number of the still picture, the data stored in the 
video buffer 96 from the leading end ffiereof is sent to ~ 
the video decoder 87 and the display is switched when 
the designated ordinal number is reached. When the 
still picture is switched, display effects can be added by 
designating the transition mode. The time to switch the 
still picture can be designated on a program-by-program 
basis before and after the display of the still picture. 
However, in the case where the reproduction apparatus 
does not have a special effect display function, the still 
picture can simply be switched with the display effects 
being ignored. 

[0308] In a synchronous mode (SlideShow), the audio 
data is decoded by the audio decoder 100, and the still 
picture, highlight and sub-picture are reproduced in 
accordance with the "display timing". The synchroniza- 
tion adjusting section 102 compares the reference time 
of the system and the "display timing" from the system 
control section 93 and control the synchronization. 
[0309] In this case, switching of only the display by the 
user is prohibited. SlideShow is effectively used, for 
example, for displaying lyrics along with the music and 
displaying a still picture in a moving picture. In the asyn- 
chronous mode (Browsable), audio data is not synchro- 
nized with and reproduced separately from highlight 
data, sub-picture data and still picture data. The high- 
light data, sub-picture data and still picture data are syn- 
chronized to one another, and the audio data is 
continuously reproduced in realtime based on the time 
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information of the system. The still picture is switched in 
the asynchronous mode by the user operation and the 
command. The user operation basically are, for exam- 
ple, an operation of continuously reproducing audio pro- 
grams while feeding display list (DLIST) forward, 5 
backward or to the leading end; and an operation of 
feeding both audio programs and display list (DLIST) 
simultaneously forward, backward or to the leading end. 
The command has similar functions. In this case, the 
"display timing" is considered as the display continua- 70 
tion time period of the still picture When the interaction 
from the user is not generated during the time period of 
the "display timing", a still picture is displayed based on 
the next DLIST. 

[0310] Browsable is effectively used, for example, for is 
displaying photos in a switching manner while music is 
reproduced and displaying various types of information 
including an explanation about the music and informa- 
tion on the people involved in a switching manner. 
[031 1] The synchronous mode and the asynchronous 20 
mode are switched over in accordance with the picture 
program reproduction control. The audio still video dis- 
play mode of the audio program information can be des- 
ignated so that some of the audio programs included in 
the audio title are randomly selected and repeated a 25 
designated number of times or such audio programs are 
repeated a designated number of times prohibiting rep- 
etition of the same program (shuffle mode). 
[0312] The AV decoder section 85c can have other 
structures. 30 
[0313] As shown in Figure 67, the system decoders 
for AOB, P_VOB and VOB can be replaced with a single 
system decoder 109 having the functions of these 
decoders. Since the AOB is input at different timing from 
the P_VOB and VOB, a system decoder which is sub- 35 
stantially the same as that specified by DVD-Video is 
usable except that the buffer memory capacities of the 
highlight buffer 94, the video buffer 96 and the sub-pic- 
ture buffer 105 are increased and that a management 
mechanism for a still picture is added to control of the 40 
video buffer 96. An AV decoder section 85d shown in 
Figure 67 operates in substantially the same manner as 
the AV decoder section 85c shown in Figure 61 . 
[0314] As shown in Figure 68, the P_VOB buffer 106 
for P_VOB can be provided immediately before the AV 45 
decoder section 85d. The same type of AV decoder 
section as used by DVD-Video is usable. All the P_VOB 
data to be read before the reproduction of the audio 
data is stored in the P_VOB buffer 106. The P_VOB 
buffer 106 dynamically multiplexes the P_VOBs corre- so 
sponding to the reproduction order with the audio 
stream and sends the resultant data to the AV decoder 
section 85d. Accordingly, the system decoder 109 for 
AOB and P_VOB in the AV decoder section 85d is 
required to have a processing speed which is slightly ss 
higher than the speed specified by the DVD-Video 
standards. Streams need to be supplied to the AV 
decoder section 85d so that the audio buffer 99 does 



not underflow. 

[031 5] In the third example, as described above, pro- 
gram reproduction information including the start time 
and reproduction time period of each piece of audio 
data based on the reproduction start time of the leading 
audio data in the MPEG2 stream is recorded in the 
management region as a part of the reproduction con- 
trol information. Accordingly, an optical disk for realizing 
reproduction of high quality digital audio data along with 
the video data in a restricted range of bit rates is pro- 
vided. Moreover, a certain interval between audio repro- 
ductions can be maintained even in an inexpensive 
reproduction apparatus including no video data repro- 
duction function. This allows title creators to create date 
easily. 

[0316] Since a plurality of still pictures, sub-picture 
and menu can be displayed synchronously with or asyn- 
chronously from high quality audio data, a wider variety 
of displays become available. 

[0317] According to the present invention, reproduc- 
tion order information defining the order of the video 
streams to be reproduced synchronously with an audio 
sequence is recorded on a recording medium. By 
changing the contents of the reproduction order infor- 
mation, the contents of the video streams reproduced 
synchronously with the audio sequence can easily be 
changed. This provides the title creators with the free- 
dom of creating a variety of video streams for one audio 
sequence, and provides the user with the freedom of 
selecting a desired video stream among a plurality of 
video streams offered for one audio sequence. 
[0318] According to the present invention, video 
reproduction mode information is recorded on the 
recording medium. Whether it is possible or not to 
change the order of the video stream to be reproduced 
in synchronization with the audio sequence based on 
interaction from a user is adjusted by the video repro- 
duction mode information. The video reproduction 
mode information enables the switching of the repro- 
duction timing/reproduction time period of the video 
stream to be reproduced synchronously with the audio 
sequence. Thus, the two different types of video repro- 
duction modes, i.e., "SlideShow" and "Browsable" can 
be offered to the user. 

[031 9] According to the present invention, prior to the 
reproduction of the audio sequence, a video stream to 
be reproduced synchronously with the audio sequence 
is buffered in the buffer provided in the reproduction 
apparatus. By guaranteeing the range of the recording 
medium to read to the buffer, reproduction by any repro- 
duction apparatus is guaranteed. As a result, title crea- 
tors can easily assume the situation in which titles are 
reproduced, which facilitates the production of the titles. 
Consequently, the high quality titles can be supplied at 
lower cost. 

[0320] According to the present invention, a noh- 
MPEG stream can be decoded by an MPEG2 decoder. 
Thus, the reproduction apparatus can be provided at 
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lower cost. 

[0321] Various other modifications will be apparent to 
and can be readily made by those skilled in the art with- 
out departing from the scope and spirit of this invention. 
Accordingly, it is not intended that the scope of the 
claims appended hereto be limited to the description as 
set forth herein, but rather that the claims be broadly 
construed. 

Claims 

1 . A data storage medium having stored thereon: 

an audio sequence including at least one audio 
stream; 

at least one video stream; and 
reproduction control information tor controlling 
reproduction of the at least one audio stream 
and the at least one video stream, 
wherein the reproduction control information 
includes reproduction order information defin- 
ing the order of at least one video stream, 
among the at least one video stream stored on 
the data storage medium, which is to be repro- 
duced in synchronization with the audio 
sequence. 

2. A data storage medium according to claim 1, 
wherein the reproduction control information further 
includes video reproduction mode information indi- 
cating whether it is possible or not to change the 
order of the at least one video stream to be repro- 
duced in synchronisation with the audio sequence 
based on interaction from a user. 

3. A data storage medium according to claim 1, 
wherein the reproduction control information further 
includes video reproduction mode information indi- 
cating whether reproduction timing of the at least 
one video stream to be reproduced in synchroniza- 
tion with the audio sequence is defined or a repro- 
duction time period thereof is defined. 

4. A data storage medium according to claim 1, 
wherein the reproduction control information further 
includes reproduction order mode information 
defining whether the at least one video stream, the 
order of which is defined by the reproduction order 
information, is reproduced randomly or sequen- 
tially. 

5. A data storage medium according to claim 4, 
wherein the reproduction order mode information 
indicates whether the at least one video stream is 
reproduced randomly while permitting overlap or 
randomly while prohibiting overlap. 

6. A data storage medium according to claim 1, 



wherein the audio sequence corresponds to at least 
one program, and the at least one video stream 
stored on the data storage medium is grouped on a 
program-by-program basis. 

5 

7. An apparatus for reproducing information from a 
data storage medium having stored thereon an 
audio sequence including at least one audio 
stream; at least one video stream; and reproduction 

10 control information for controlling reproduction of 
the at least one audio stream and the at least one 
video stream, wherein the reproduction control 
information includes reproduction order information 
defining the order of at least one video stream, 

is among the at least one video stream stored on the 
data storage medium, which is to be reproduced in 
synchronization with the audio sequence, the appa- 
ratus comprising: 

20 a reading section for reading the audio 

sequence and the at least one video stream 
stored on the data storage medium; and 
a reproduction section for reproducing the at 
least one video stream in synchronization with 

25 the audio sequence in the order defined by 

reproduction order information. 

8. An apparatus according to claim 7, wherein the 
reproduction control information further includes 

30 video reproduction mode information indicating 
whether iris possible or norto change the orderof 
the at least one video stream to be reproduced in 
synchronisation with the audio sequence based on 
interaction from a user. 

35 

9. An apparatus according to claim 7, wherein the 
reproduction control information further includes 
video reproduction mode information indicating 
whether reproduction timing of the at least one 

40 video stream to be reproduced in synchronization 
with the audio sequence is defined or a reproduc- 
tion time period thereof is defined. 

10. An apparatus according to claim 7, wherein the 
45 reproduction control information further includes 

reproduction order mode information defining 
whether the at least one video stream, the order of 
which is defined by the reproduction order informa- 
tion, is reproduced randomly or sequentially. 

so 

11. An apparatus according to claim 10, wherein the 
reproduction order mode information indicates 
whether the at least one video stream is repro- 
duced randomly while permitting overlap or ran- 

55 domly while prohibiting overlap. 

12. An apparatus according to claim 7, wherein the at 
least one video stream to be reproduced in syn- 
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chronization with the audio sequence is buffered 
into a buffer located in the reproduction apparatus 
before reproduction of the audio sequence. 

13. A method for reproducing information from a data 
storage medium having stored thereon an audio 
sequence including at least one audio stream; at 
least one video stream; and reproduction control 
information for controlling reproduction of the at 
least one audio stream and the at least one video 
stream, wherein the reproduction control informa- 
tion includes reproduction order information defin- 
ing the order of at least one video stream, among 
the at least one video stream stored on the data 
storage medium, which is to be reproduced in syn- 
chronisation with the audio sequence, the method 
comprising the steps of: 

reading the audio sequence and the at least 
one video stream stored on the data storage 
medium; and 

reproducing the at least one video stream in 
synchronization with the audio sequence in the 
order defined by reproduction order informa- 
tion. 

14. A method according to claim 13, wherein the repro- 
duction control information further includes video 
reproduction mode information indicating whether it 
is possible or not to change the order of the at least 
one video stream to be reproduced in synchroniza- 
tion with the audio sequence based on interaction 
from a user. 

15. A method according to claim 13, wherein the repro- 
duction control information further includes video 
reproduction mode information indicating whether 
reproduction timing of the at least one video stream 
to be reproduced in synchronization with the audio 
sequence is defined or a reproduction time period 
thereof is defined. 

16. A method according to claim 13, wherein the repro- 
duction control information further includes repro- 
duction order mode information defining whether 
the at least one video stream, the order of which is 
defined by the reproduction order information, is 
reproduced randomly or sequentially. 

17. A method according to claim 16, wherein the repro- 
duction order mode information indicates whether 
the at least one video stream is reproduced ran- 
domly while permitting overlap or randomly while 
prohibiting overlap. 

18. A method according to claim 13, wherein the at 
least one video stream to be reproduced in syn- 
chronization with the audio sequence is buffered 



into a buffer located in the reproduction apparatus 
before reproduction of the audio sequence. 

19. A data storage medium stored with a first system 
5 stream and a second system stream to be repro- 
duced in synchronization with each other, wherein: 

the first system stream includes reproduction 
control information indicating timing at which 
10 the first system stream is reproduced, and 

the second system stream is reproduced at 
timing determined by information other than 
information included in the second system 
stream. 

15 

20. An apparatus for reproducing a data storage 
medium stored with a first system stream and a 
second system stream to be reproduced in syn- 
chronization with each other, wherein the first sys- 

20 tern stream includes reproduction control 
information indicating timing at which the first sys- 
tem stream is reproduced, the apparatus compris- 
ing: 

25 a reading section for reading the first system 

stream and the second system stream stored 
on the data storage medium; and 
a decoder section for decoding the first system 
stream in accordance with the reproduction 

30 control information, 

wherein the second system stream is decoded 
in accordance with a control signal supplied 
from outside of the decoder section. 

35 21. A method for reproducing a data storage medium 
stored with a first system stream and a second sys- 
tem stream to be reproduced in synchronization 
with each other, wherein the first system stream 
includes reproduction control information indicating 

40 timing at which the first system stream is repro- 
duced, the method comprising the steps of: 

reading the first system stream and the second 

system stream stored on the data storage 
45 medium; and 

controlling the timing at which the first system 

stream is reproduced in accordance with the 

reproduction control information; 

controlling the timing at which the second sys- 
so tern stream is reproduced in accordance with 

information other than information included in 

the second system stream. 
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